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Field of the Invention 

The present invention relates to a family of novel plant genes encoding 
polypeptides characterized by their ability to hydrolyze terminal non-reducing 
p-D-galactosyl residues from p-D-galactosides. More specifically, a 
polynucleotide sequence derived from a cDNA clone designated pZBG2-l-4 
(referred to in U.S. Provisional Appln. No. 60/088,805 as pTompgal 4), which 
encodes a specific plant polypeptide named p-galactosidase II, is provided. 
Also provided are cDNA clones encoding six other homologous polypeptides, 
methods of using these cDNA clones for producing £-D-galactoside 
polypeptides of the invention, and methods of modifying fruit quality by 
employment of a polynucleotide or polypeptide of the present invention. 

Background of the Invention 
The most conspicuous and important processes related to post-harvest 
quality of climacteric fruit are the changes in texture, color, taste, and aroma 
which occur during ripening. Because of the critical relationship that 
deleterious changes in texture have to quality and post-harvest shelf-life, 
emphasis has been placed on studying the mechanisms involved in the loss of 
firmness that occurs'during tomato fruit ripening. Although fruit softening 
may involve changes in turgor pressure, anatomical characteristics and cell 
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wall integrity, it is generally assumed that cell wall disassembly leading to a 
loss of wall integrity is a critical feature. The most apparent changes, in terms 
of composition and size, occur in the pectic fraction of the cell wall (see 
references in Seymour and Gross, 1996). 

Changes known to occur in the pectic fraction of the cell wall during fruit 
ripening include increased solubility, de£olymerization, de-esterification and a 
significant net loss of neutral sugar containing side chains (Huber, 1983; - 
Fischer and Bennett, 1991; Seymour and Gross, 1996). The best characterized 
pectin-modifying enzymes are polygalacturonase (endo-al-M-D-galacturonan 
hydrolase; E.C. 3.2.1.15; PG) and pectin methylesterase (E.G. 3.1.1.11; PME). 
Although PG and PME are relatively abundant and have substantial activity 
during tomato fruit ripening, softening still occurs, albeit with a slight delay, in 
fruit where PG (Smith et al 1988, 1990) or PME (Tieman et al 1992; Hal] et 
al 1993) gene expression and enzyme activity was significantly down- 
regulated in transgenic plants. Moreover, over-expression of PG in non- 
ripening mutant rin tomato fruit did not result in softening even though 
depolymerization and solubilization of pectin was evident (Giovannoni et al, 
1989). 

Among the other known pectin modifications that occur during fruit 
development, one of the best characterized is the significant net loss of 
galactosyl residues which occurs in the cell walls of many ripening fruit (Gross 
and Sams, 1984; Seymour and Gross, 1996). Although some loss of galactosyl 
residues could result indirectly from the action of PG, (3-galactosidase (exo- 
[5(l-»4)-D-galactopyranoside; E.C. 3.2.1.23) is the only enzyme identified in 
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higher plants capable of directly cleaving p(l-»4)galactan bonds, and probably 
plays a role in galactan sidechain loss (DeVeau et al, 1993; Carey et al, 1995; 
Carrington and Pressey, 1996). No endo-acting galactanase has yet been 
identified in higher plants. The view that (5-galactosidase is active in releasing 
galactosyl residues from the cell wall during ripening is supported by the 
dramatic increase in free galactose, a product of p-galactosidase activity 
(Gross, 1984) and a concomitant increase in activity of a particular enzyme, 
designated p-galactosidase n, in tomatoes during ripening (Carey et al, 1995). 
p-galactosidase activity is thought to be important in cell wall metabolism 
(Carey et al., 1995). P-Galactosidases are generally assayed using artificial 
substrates such as /?-nitrophenyl-p-D-galactopyranoside (PNP), 4- 
methylumbelliferyl-p-D-galactopyranoside and 5-bromo-4-chloro-3-indoxyl- 
p-D-galactopyranoside (X-GAL). However, it is clear that p-galactosidase E 
is also active against natural substrates, i.e., P (l-*4)galactan (Carey et al, 
1995; Carrington and Pressey, 1996; Pressey, 1983). j5-Galactosidase proteins 
have been purified and characterized in a number of other fruits including 
kiwifruits (Ross et al, 1993), coffee (Golden et al, 1993), persimmon (Kang 
etal, 1994), and apple (Ross et al, 1994). 

Carey et al (1995) were able to purify three previously identified P~ 
galactosidases from ripening tomato fruit (Pressey, 1983), but only one (£- 
galactosidase II) was active against p(l-*4)galactan. Even though they were 
able to identify putative p-galactosidase cDNA clones, none of the cDNA's 
deduced amino acid sequences matched the amino terminal sequence of the P~ 
galactosidase II protein. Although P -galactosidase II, a protein present in 
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tomato (Lycopersicon esculentum Mill.) fruit during ripening and capable of 
degrading tomato fruit galactan has been purified, cloning of the 
corresponding gene has been elusive. 

The modification of plant gene expression has been achieved by several 
methods. The molecular biologist can choose from a range of known methods 
to decrease or increase gene expression ot to alter the spatial or temporal 
expression of a particular gene. For example, the expression of either specific 
antisense RNA or partial (truncated) sense RNA has been utilized to reduce 
the expression of various target genes in plants (as reviewed by Bird and Ray, 
1991, Biotechnology and Genetic-Engineering Reviews 9:207-227). These 
techniques involve the incorporation into the genome of the plant of a 
synthetic gene designed to express either antisense or sense RNA. They have 
been successfully used to down-regulate the expression of a range of 
individual genes involved in the development and ripening of tomato fruit 
(Gray et al, 1992, Plant Molecular Biology, i9:69-87). Methods to increase the 
expression of a target gene-have also been developed. For example, additional 
genes designed to express RNA containing the complete coding region of the 
target gene may be incorporated into the genome of the plant to "over-express" 
the gene product. Various other methods to modify gene expression are 
known; for example, the use of alternative regulatory sequences. The 
complete disclosure of each of the references cited above is fully incorporated 
herein by reference. 

The need therefore exists to clone a gene for p-galactosidase II and related 
polypeptides, and using known methods of modification of plant gene 
expression, thereby to provide methods for modifying quality of fruits, 
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particularly by modifying the cell wall, thereby directly affecting the ripening 
of the fruit. 



Summary of the Invention 

The present invention is based on the discovery of novel DNA sequences 
derived from cDNA clones from a family ^pf genes encoding p-galactosidases. 
The phylogenic tree based on the shared amino acid sequence identities for the 
DNA sequences of the present invention is shown in Figure 1 A3. Five cDNA 
and two RT-PCR clones, designated herein as TBG1, TBG2, TBG3, TBG4, 
TBG5, TBG6, and TBG7 and having the nucleic acid sequences designated 
SEQ ID NOs 1-7, respectively as shown in Figure 2, were identified which had 
a high degree of shared sequence identity to other known p-galactosidases. 
The corresponding amino acid sequences are designated herein as SEQ ID 
NOs 8-16, respectively and are shown in Figure 2 and 3. The nucleotide 
sequences for SEQ ID NOs 1-7 are recorded in Gen Bank with the following 
respective Accessions Numbers: 



SEQ ID NO: 1 


TGB1 


AF023847 


deposit Sept 10, 1997 


SEQ ID NO:2 


TGB2 


AF 154420 


deposited May 19, 1999 


SEQ ID NO: 3 


TGB3 


AF 154421 


deposited May 20, 1999 


SEQ ID NO:4 


TGB4 


AF020390 


deposited Aug 21, 1997 


SEQ ID NO:5 


TGB5 


AF 154423 


deposited May 20, 1999 


SEQ ID NO:6 


TGB6 


AF 154424 


deposited May 20, .1999 


SEQ ID NO: 7 


TGB7 


AF 154422 


deposited May 20, 1999 
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Throughout the following discussion, wherever TBG4 is indicated in the 
description of the invention, it is to be understood that TBG1-3 and 5-7 are 
also to be included in that description, unless otherwise indicated. 

A method of providing a DNA sequence of the invention, either by 
cloning a cDNA (for instance, pZBG2-l-4) that codes for a protein of the 
present invention, such as {^galactosidase II, or by deriving the DNA sequence 
from genomic DNA, or by synthesis of a DNA sequence ab initio using the 
cDNA sequence as a guide is also provided. 

A method for modifying cell wall metabolism which involves modifying 
the activity of at least one galactosidase, and thus modifying the quality of the 
fruit is also provided. 

Also provided by the present invention is a DNA construct including 
some or all of an exemplary P -galactosidase DNA sequence under control of a 
transcriptional initiation region operative in plants, so that the construct can 
generate RNA in plant cells. 

Also discovered is an enhancer/promoter associated with expression of 
the genes encoding ^-galactosidase. 

The present invention also relates to recombinant vectors, which include 
the isolated nucleic acid molecules of the present invention, and to host cells 
containing the recombinant vectors, as well as to methods of making such 
vectors and host cells and for using them for production of p -galactosidase 
polypeptides or peptides by recombinant techniques. 
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The present invention also provides plant cells containing DNA 
constructs of the present invention; plants derived therefrom having modified 
p-galactosidase gene expression; and seeds produced from such plants. 

The p-galactosidase II protein of the present invention has demonstrated 
enzyme activity in cell wall disassembly leading to loss of tissue integrity and 
fruit softening. The P-galactosidase II protein also may be involved in cell 
wall turnover, which could be involved in cell extension and/or expansion and 
therefore plant growth and development. 

By hydrolyzing galactose from the cell wall, the enzyme may allow 
ripening to commence and/or progress, since galactose may be involved in 
stimulating ethylene production alone or in conjunction with unconjugated N- 
glycans. 

The |3-galactosidase of the invention may be involved in conversion of 
chloroplasts (green - chlorophyll) to chromoplasts (red - lycopene) during 
fruit ripening by degrading chloroplast membrane galactolipids. 

The family of genes represented by the nucleotide sequences shown in 
Figure 2 is expected to code for a group of similar enzymes with the same 
type of hydrolytic activity but with different tissue and/or substrate 
specificity 's or cellular compartmentation profiles. 

The p-galactosidase II protein of the present invention as well as other 
proteins encoded in the nucleotide sequences shown in Figure 2 may be used 
for preparation of pectin and other cell wall derived polymers with lowered 
galactosyl content for use in biofilms and solutions (for example in 
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clarification of fruit juices) requiring lower or higher cross-linking or 
viscomertric properties. 

The present invention also provides (3-galactosidase enzymes for use as 
components of enzyme mixtures for protoplast isolation. 

Brief Description of the Figures 

Figure 1 A and IB shows a phylogenic tree based on shared amino acid 
sequence identity among tomato P-galactosidase clones TGB 1-7 and other 
known plant p-galactosidase polypeptides. 

Figure 2 shows cDNA sequences [SEQ ID NOs: 1-7, respectively] for the 
seven {3-gaiactosidase genes of the invention: TGB1, TGB2, TGB 3, TGB4, 
TGB5, TGB6, TGB 7. 

Figure 3 shows multiple sequence alignment of the deduced amino acid 
sequences of tomato fruit for cDNA clones TGB 1 , TGB2, TGB3, TGB4, 
TGB5, TGB 6 and TGB 7 [SEQ ID NOs: 8-16, respectively] and various plant 
P~galactosidase cDNA clones. 

Figure 4 shows autoradiograph of northern blot analysis of TBG expression in 
various plant tissues (flowers, leaves, roots and stems). 

Figure 5 shows Autoradiograph of northern blot analysis of TBG expression 
in fruit tissues at different stages of development. 
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Figure 6 shows autogradiograph of northern blot analysis of TBG expression 
in fruit tissues (mature green or turning stage fruit peel, outer pericarp, inner 
paricarp and locular). 

Figure 7 shows autoradiograph of northern blot analysis of TBG expression in 
normal and mutant fruit tissues. 

Figure 8 shows autoradiograph of northern blot analysis of TBG expression in 
response to ethylene treatment of mature green fruit tissues. 

Figure 9 shows Western blot analysis of TBG4 expression by yeast. 

Figure 10 shows detection of p-galactosidase activity from pZBG2-l-4 
expression in E. coll 

Figure 11 A - E (1-4) shows the comparative results of texture measurements 
for fruit from tomato plants containing antisense constructs to suppress TBG4 
mRNA and fruit from the parental line- 
Figures 12A - B show Northern blot analysis of TBG4 expression in 
transgenic fruit containing TBG4 antisense construct. 

Figure 13 shows a Binary construct used to transform plants and express 
TBG4 (pZBG2-l -4) in the antisense orientation. 
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Detailed Description 

The following detailed description is directed to a preferred 
embodiment of the present invention and is intended as illustrative of each of 
other DNA sequences of the present invention. 

The present invention provides isolated nucleic acid molecules 
comprising a polynucleotide encoding P-galactosidase polypeptides, 
particularly a p-galactosidase II polypeptide having the amino acid sequence 
shown in Figure 2. The DNA sequence of the exemplary p-galactosidase II 
cDNA clone of the invention, which was determined from a cDNA clone, 
pZBG2-l-4, encoding p-galactosidase II, is recorded in GenBank as Accession 
Number AF020390. Not all P-galactosidases possess in vitro activity against 
extracted cell wall material via the release of galactose from wall polymers 
containing p(l-»4)-D-galactan. The polypeptide expressed from the exemplary 
p-galactosidase II clone, pZBG2-l-4, has been shown to exhibit 
P-galactosidase activity and exogalactinase activity. 

The exemplary p-galactosidase II protein of the present invention, as 
shown in Figure 2, shares sequence homology with the amino acid sequence 
deduced from P-galactosidase cDNA clones of TBG2-7 and cDNA clones of 
the fruits of asparagus (accession number P45582), apple (accession number 
P48981), and carnation (accession number Q00662), as well as with p- 
galactosidase cDNA clones of a previously published sequence of a tomato p- 
galactosidase cDNA clone designated pTomPgal 1 (accession number P48980) 
isolated from ripe 'Ailsa Craig' fruit (Carey et al, 1995). The ORF of the 
clone TBG1 disclosed herein by the inventors (accession number AF023847) 
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is nearly identical to the cDNA previously described by Carey et al. As shown 
in Figure 2, the shared deduced sequence identity is high among all the 
published plant p-galactosidases of the seven clones (TBG1-7) and the other 
plant P-galactosidases. 

BLAST searches of the database also indicated significant shared 
sequence identity between domains of theplant p-galactosidases and 
mammalian and fungal p-galactosidases, however little share sequence identity 
was detected with bacterial p-galactosidases. 

As shown in Figure 1, the shared amino acid identity of TBG1 and 
TBG3 was high. TBG4 was also very similar to both TBG1 and 3. The amino 
acid sequences of TBG2 and 7 were unique because several regions of amino 
acid insertions appear throughout their sequence (Figure 3). 



Nucleic Acid Molecules 

Unless otherwise indicated, all nucleotide sequences determined by 
sequencing a DNA molecule herein were determined using a PCR-based 
dideoxynucleotide terminator protocol and an ABI automated DNA sequencer 
(such as the Model 373 from Applied Biosystems, Inc., Foster City, CA), and 
all amino acid sequences of polypeptides encoded by DNA molecules 
determined herein were predicted by translation of a DNA sequence 
determined as above. Therefore, as is known in the art for any DNA sequence 
determined by this automated approach, any nucleotide sequence determined 
herein may contain some errors. Nucleotide sequences determined by 
automation are typically at least about 90% identical, more typically at least 
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about 95% to at least about 99.9% identical to the actual nucleotide sequence 
of the sequenced DNA molecule. The actual sequence can be more precisely 
determined by other approaches including manual DNA sequencing methods 
well known in the art. As is also known in the art, a single insertion or 
deletion in a determined nucleotide sequence compared to the actual sequence 
will cause a frame shift in translation of v the nucleotide sequence such that the 
predicted amino acid sequence encoded by a determined nucleotide sequence 
will be completely different from the amino acid sequence actually encoded by 
the sequenced DNA molecule, beginning at the point of such an insertion or 
deletion. 

By "nucleotide sequence" of a nucleic acid molecule or polynucleotide 
is intended, for a DNA molecule or polynucleotide, a sequence of 
deoxyribonucleotides, and for an RNA molecule or polynucleotide, the 
corresponding sequence of ribonucleotides (A, G, C and U), where each 
thymidine deoxyribonucleotide (T) in the specified deoxyribonucleotide 
sequence is replaced by the ribonucleotide uridine (U). 

Using the information provided herein, such as the exemplary nucleotide 
sequence shown in Figure 2 [SEQ ID NO: 4], a nucleic acid molecule of the 
present invention encoding a fJ-galactosidase II polypeptide may be obtained 
using standard cloning and screening procedures, such as those for cloning 
cDNAs using mRNA as starting material. Illustrative of the invention, the 
nucleic acid molecule described in Figure 2 [SEQ ID NO; 4] was discovered 
in a cDNA library derived from breaker, turning and pink fruit pericarp from 
'Rutgers' tomato plants. 
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The complete sequence of the cDNA insert of pZBG2-l-4 is accessible in 
the GenBank (no. AF020390) and is provided in Figure 2 [SEQ ID NO: 4]. 
The cDNA insert is 2532 nucleotides (nt) long and contains a single, long open 
reading frame (ORF) predicted to start with the first in-frame ATG at nt 64 
and end with TAA at nt 2238. This ORF codes for a 79 kD protein 724 amino 
acids long. The deduced amino acid sequence of pZBG2-l-4 shared 
significant amino acid identity to all published plant p-galactosidase sequences 
in the database (Figure 1 A,B)- When the entire ORF of each (3-galactosidase 
gene was compared to pZBG2-l-4, the shared sequence identity was about 
64% for tomato pTom(3gal 1 (P48980), about 67.6% for apple (P48981), about 
63% for asparagus (P45582) and about 55% for carnation (Q00662). As one 
of ordinary skill would appreciate, due to the possibilities of sequencing errors 
discussed above, the actual complete (5-galactosidase II polypeptide encoded 
by the deposited cDNA, which comprises about 724 amino acids, may be 
somewhat longer or shorter. More generally, the actual open reading frame 
may be anywhere in the range of ±20 amino acids, more likely in the range of 
±10 amino acids, of that predicted from either the first methionine codon from 
the N-terminus shown in Figure 2 [SEQ ID NO: 4]. In any event, as discussed 
further below, the invention further provides polypeptides having various 
residues deleted from the N-terminus of the complete polypeptide, including 
polypeptides lacking one or more amino acids from the N-terminus of the fj- 
galactosidase II polypeptide described herein. 
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Leader and Mature Sequences 

Analysis of the deduced amino acid sequence of pZBG2-l-4 suggested 
a high probability for secretion based on the presence of a hydrophobic leader 
sequence, a leader sequence cleavage site and three possible N-glycosylation 
sites. The programs PSORT V6.4 (Nakai and Kanehisa, 1992, incorporated 
herein by reference) and SignalP Vl.l (Nielsen et al., 1997, incorporated 
herein by reference), were used to predict that the ORF contains a hydrophobic 
leader sequence that would be cleaved between the alanine and serine residues 
at positions 23 and 24 respectively, and that the mature polypeptide has an 
extracellular location. The mature polypeptide contains three possible N- 
glycosylation sites at asparagine numbers 282, 459 and 713, however the 
asparagine at position 713 is unlikely to be glycosylated due to the proline at 
position 714. The predicted molecular mass of the unglycosylated mature 
polypeptide was 75 kD with a pi of 8.9. 

Accordingly, the amino acid sequence of the complete {i-galactosidase 
II protein of the invention includes a leader sequence and a mature protein, as 
shown in Figure 3 [SEQ ID NO: 4]. More in particular, the present invention 
provides nucleic acid molecules encoding a mature form of the (i-galactosidase 
II protein. Thus, according to the signal hypothesis, secreted proteins have a 
signal or secretory leader sequence which is cleaved from the complete 
polypeptide to produce a secreted '"mature" form of the protein. In some cases, 
cleavage of a secreted protein is not entirely uniform, which results in two or 
more mature species of the protein. Further, it has long been known that the 
cleavage specificity of a secreted protein is ultimately determined by the 
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primary structure of the complete protein, that is, it is inherent in the amino 
acid sequence of the polypeptide. Therefore, the present invention provides a 
nucleotide sequence encoding the mature p-galactosidase II polypeptide 
having the amino acid sequence encoded by the cDNA shown in Figure 2 
[SEQ ID NO: 4] and provided in GenBank (Accession No. AF20390). By the 
"mature (J-galactosidase II polypeptide having the amino acid sequence 
encoded by the cDNA clone shown in Figure 2 [SEQ ID NO: 4] is meant the 
mature form(s) of the P-galactosidase II protein produced by expression in a 
plant cell of the complete open reading frame encoded by the cDNA sequence 
of the clone shown in Figure 2 [SEQ ID NO: 4] and provided in GenBank 
(Accession No. AF20390). 

The exemplary [3-galactosidase II cDNA of the present invention 
(TBG4) has been expressed in E. coli strain XLI blue MR (lacZ) (Stratagene, 
La Jolla, CA), as described hereinbelow (see Example). 

Analysis of the deduced amino acid sequence of cDNA clones 
representing the other p-galactosidase genes of the invention also revealed 
open reading frames and, in some cases, suggested a high probability for 
secretion of the encoded proteins. All the full-length cDNA clones were 
predicted to have a signal sequence (Fig. 2). Using the two prediction 
programs SignalP and PSORT, TBG4 was predicted to be secreted by both 
programs. TBG1, 2 and 3 were predicted to have cleavable signal sequences 
by SignalP, but uncleavable signal sequences by PSORT. TBG7 was suggested 
to be targeted to the chloroplast by PSORT. Particular observations for each 
of the seven clones are as follows, based on the presence of a hydrophobic 
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leader predicted by the programs PSORT V6. and SignalP V 1 . 1 : TBG 1 : 
initiation codon at 306 [SEQ ID NO: 1 ], ORF = 835 amino acids [SEQ ID 
NO: 8], signal sequence at 1-24; TBG2: initiation codon not determined [SEQ 
ID NO: 2], ORF = 888 amino acids [SEQ ID NO: 9], signal sequence at 1-25; 
TBG3: initiation codon at 32 [SEQ ID NO: 3], ORF = 838 amino acids [SEQ 
ID NO: 10], signal sequence at 1-22; TBp5: initiation codon not determined 
[SEQ ID NO:5], ORF = 251 amino acids [SEQ ID NO: 12], signal sequence 
not determined; TBG6: initiation codon not determined [SEQ ID NO:6], ORF 
= 248 amino acids [SEQ ID NO: 13], signal sequence not determined; TBG7: 
initiation codon at 104 [SEQ ID NO: 7], ORF = 870 amino acids [SEQ ID 
NO: 14], signal sequence at 1-35. 

The deduced amino acid sequences of the seven clones was also 
subjected to analysis using the program DNAsis and the predictions for 
molecular mass, cellular targeting, pi and potential N-linked glycosylation 
sites are summarized in Table L 
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Table I. Tomato (3-galactosidase (TBG) cDNA sequence data. Five 
full-iength and 2 partial-length cDNAs were cloned and sequenced. 
The DNA and deduced amino acid sequence data is presented 
below 



CLONE 


mRNA(kb) 


kD 


Pi 


N-LINK 


TARGET 


TBG1 


3.2 


90.8 


6.2 


2 


ER/OUT 


TBG2 


3.0 


97.0 


6.2 


6 


PM 


TBG3 


2.8 


90.5 


8.2 


1 


ER/OUT 


TBG4 


2.6 


77.9 


8.9 


3 


OUT 


TBG5 


~3 










TBG6 


~3 










TBG7 


3.0 


93.3 


8.0 


6 


CHLOR 



N-LINK = possible N-linked glycosylation sites; ER = endoplasmic 
reticulum; out = secreted; PM = tethered to plasma membrane; CHLOR 
- chloroplast 



As indicated, nucleic acid molecules of the present invention may be in 
the form of RN A, such as mRNA, or in the form of DNA, including, for 
5 instance, cDN A and genomic DNA obtained by cloning or produced 

synthetically. The DNA may be double-stranded or single-stranded. 
Single-stranded DNA or RNA may be the coding strand, also known as the 
sense strand, or it may be the non-coding strand, also referred to as the 

anti-sense strand. 

10 By "isolated" nucleic acid molecule(s) is intended a nucleic acid 

molecule, DNA or RNA, which has been removed from its native environment 





WO 99/64564 



PCT/US99/12697 



18 



For example, recombinant DNA molecules contained in a vector are 
considered isolated for the purposes of the present invention. Further 
examples of isolated DNA molecules include recombinant DNA molecules 



DNA molecules in solution. Isolated RNA molecules include in vivo or in 
vitro RNA transcripts of the DNA molequles of the present invention. Isolated 
nucleic acid molecules according to the present invention further include such 
molecules produced synthetically. 

Isolated nucleic acid molecules of the present invention include DNA 
molecules comprising an open reading frame (ORF) with an initiation codon at 
position 64 of the nucleotide sequence shown in Figure 2 [SEQ ID NO: 4]. 
Also included are DNA molecules comprising the coding sequence for the 
mature p-galactosidase II protein shown at positions 135-2532 of Figure 2 
[SEQ ID NO: 4]. 

In addition, isolated nucleic acid molecules of the invention include 
DNA molecules which comprise a sequence substantially different from those 
described above but which, due to the degeneracy of the genetic code, still 
encode the p-galactosidase II protein. Of course, the genetic code and species- 
specific codon preferences are well known in the art. Thus, it would be 
routine for one skilled in the art to generate the degenerate variants described 
above, for instance, to optimize codon expression for a particular host (e.g., 
change codons in the plant mRNA to those preferred by a bacterial host such * 
as E. coli). Preferably, this nucleic acid molecule will encode the mature 
polypeptide encoded by the above-described deposited cDNA clone. 



maintained in heterologous host cells or purified (partially or substantially) 
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The invention further provides an isolated nucleic acid molecule 
having the nucleotide sequence shown in Figure 2 [SEQ ID NO: 4] or a 
nucleic acid molecule having a sequence complementary to the above 
sequence. Such isolated molecules, particularly DNA molecules, are useful as 
probes for gene mapping, by in situ hybridization with chromosomes, and for 
detecting expression of the p-galactosidase n gene in plant tissue, for instance, 
by Northern blot analysis. 

The present invention is further directed to nucleic acid molecules 
encoding portions of the nucleotide sequences described herein as well as to 
fragments of the isolated nucleic acid molecules described herein. In 
particular, the invention provides a polynucleotide having a nucleotide 
sequence representing the portion of Figure 2 [SEQ ID NO: 4] which consists 
of positions 1-2538 of Figure 2 [SEQ ID NO: 4]. 

In addition, the invention provides additional nucleic acid molecules 
having nucleotide sequences related to extensive portions of Figure 2 [SEQ 
ID NO: 4] which have been determined from the following related cDNA 
clones: TBG1-3 and TBG5-7 as shown in Figure 3, SEQ. NO's 1-3 and 5-7 

In another aspect, the invention provides an isolated nucleic acid 
molecule comprising a polynucleotide which hybridizes under stringent 
hybridization conditions to a portion of the polynucleotide in a nucleic acid 
molecule of the invention described above, for instance, the cDNA clone 
shown in Figure 2 [SEQ ID NO: 4], By "stringent hybridization conditions" is 
intended overnight incubation at 42° C in a solution comprising: 50% 
formamide, 5x SSC (150 mM NaCl, 15 mM trisodium citrate), 50 mM sodium 
phosphate (pH 7.6), 5x Denhardt's solution, 10% dextran sulfate, and 20 pg/ml 
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denatured, sheared salmon sperm DNA, followed by washing the filters in 
O.lxSSC at about 65° C 

As indicated, nucleic acid molecules of the present invention which 
encode a p-galactosidase II polypeptide may include, but are not limited to 
those encoding the amino acid sequence of the mature polypeptide, by itself; 
and the coding sequence for the mature polypeptide and additional sequences, 
such as those encoding the about 1-23 amino acid leader sequence, such as a 
pre-, or pro- or prepro- protein sequence; the coding sequence of the mature 
polypeptide, with or without the aforementioned additional coding sequences. 

Also discovered is an enhancer/promoter associated with expression of 
the genes encoding p-galactosidase. The inventors have characterized the 
expression profile of TBG2 mRNA and have cloned a lambda genomic cDNA. 
TBG2 is expressed before the onset of fruit ripening and continues at uniform 
level throught all the ripening stages. TBG2 has been found to be expressed in 
all fruit tissues and has also been found to be fruit specific. Experiments have 
shown TBG2 to be unaffected by ethylene. TBG2 is expressed in the ripening 
mutants rin, nor and Nr at the normal chronological time after anthesis. The 
promoter discovered would be useful to express any gene in the sense or 
antisense orientation, specifically in tomato fruit, in all tomato fruit tissues, 
starting before and continuing throughout the entire ripening process. The 
promoter could also be used to express any gene in the ripening mutants rin, 
nor and Nr without the need to gas the fruit with exogenous ethylene. 
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Variant and Mutant Polynucleotides 

The present invention further relates to variants of the nucleic acid 
molecules of the present invention, which encode portions, analogs or 
derivatives of the P~galactosidase II protein. Variants may occur naturally, 
such as a natural allelic variant. By an "allelic variant" is intended one of 
several alternate forms of a gene occupying a given locus on a chromosome of 
an organism. Genes II, Lewin, B., ed., John Wiley & Sons, New York (1985). 
Non-naturally occurring variants may be produced using art-known 
mutagenesis techniques. 

Such variants include those produced by nucleotide substitutions, 
deletions or additions. The substitutions, deletions or additions may involve 
one or more nucleotides. The variants may be altered in coding regions, 
non-coding regions, or both. Alterations in the coding regions may produce 
conservative or non-conservative amino acid substitutions, deletions or 
additions. Especially preferred among these are silent substitutions, additions 
and deletions, which do not alter the properties and activities of the p- 
galactosidase II protein or portions thereof. Also especially preferred in this 
regard are conservative substitutions. 

Most highly preferred are nucleic acid molecules encoding the mature 
protein having the amino acid sequence shown in Figure 2 as pZBG2-l-4 or 
the mature (3-galactosidase II amino acid sequence encoded by the deposited 
cDNA clone. 

Further embodiments include an isolated nucleic acid molecule 
comprising a polynucleotide having a nucleotide sequence at least 90% 
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identical, and more preferably at least 95%, 96%, 97%, 98% or 99% identical 
to a polynucleotide selected from the group consisting of: (a) a nucleotide 
sequence encoding the (3-galactosidase II polypeptide having the complete 
amino acid sequence in Figure 2 [SEQ ID NO: 4] (b) a nucleotide sequence 
encoding the mature p-galactosidase II polypeptide shown in Figure 2 [SEQ 

ID NO: 4]; (c) a nucleotide sequence complementary to any of the nucleotide 
sequences in (a) or (b) above. 

Vectors and Host Cells 

The present invention also relates to vectors which include the isolated 
DNA molecules of the present invention, host cells which are genetically 
engineered with the recombinant vectors, and the production of {}- 
galactosidase II polypeptides or fragments thereof by recombinant techniques. 
The vector may be, for example, a phage, plasmid, viral or retroviral vector. 
Retroviral vectors may be replication competent or replication defective. In 
the latter case, viral propagation generally will occur only in complementing 
host cells. 

The polynucleotides may be joined to a vector containing a selectable 
marker for propagation in a host. Generally, a plasmid vector is introduced in 
a precipitate, such as a calcium phosphate precipitate, or in a complex with a 
charged lipid. If the vector is a virus, it may be packaged in vitro using an 
appropriate packaging cell line and then transduced into host cells. 

The DNA insert should be operatively linked to an appropriate 
promoter, such as the phage lambda PL promoter, the E. coli lac, trp, phoA 
and tac promoters, the SV40 early and late promoters and promoters of 




\V<3 99/64564 



PCT7US99/12697 



23 



retroviral LTRs, to name a few. Other suitable promoters will be known to the 
skilled artisan. The expression constructs will further contain sites for 
transcription initiation, termination and, in the transcribed region, a ribosome 
binding site for translation. The coding portion of the transcripts expressed by 
the constructs will preferably include a translation initiating codon at the 
beginning and a termination codon (UAA> UGA or UAG) appropriately 
positioned at the end of the polypeptide to be translated. 

As indicated, the expression vectors will preferably include at least one 
selectable marker. Such markers include dihydrofolate reductase, G418 or 
neomycin resistance for eukaryotic cell culture and tetracycline, kanamycin or 
ampicillin resistance genes for culturing in £. coli and other bacteria. 
Representative examples of appropriate hosts include, but are not limited to, 
bacteria] cells, such as E, coli, StrepZBG2-l-4yces and Salmonella 
typhimurium cells; fungal cells, such as yeast cells; insect cells such as 
Drosophila S2 and Spodoptera Sf9 cells; animal cells such as CHO, COS, 293 
and Bowes melanoma cells; and plant cells. Appropriate culture mediums and 
conditions for the above-described host cells are known in the art. 

Among vectors preferred for use in bacteria include pQE70, pQE60 
and pQE-9, available from QIAGEN, Inc., supra; pBS vectors, Phagescript 
vectors, Bluescript vectors, pNH8A, pNH16a, pNH18A, pNH46A, available 
from Stratagene; and ptrc99a, pKK223-3, pKK233-3, pDR540, pRIT5 
available from Pharmacia. Among preferred eukaryotic vectors are 
pWLNEO, pSV2CAT, pOG44, pXTl and pSG available from Stratagene; and 
pSVK3, pBPV, pMSG and pSVL available from Pharmacia. Other suitable 
vectors will be readily apparent to the skilled artisan. 
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Introduction of the construct into the host cell can be effected by 
calcium phosphate transfection, DEAE-dextran mediated transfection, cationic 
lipid-mediated transfection, electroporation, transduction, infection or other 
methods. Such methods are described in many standard laboratory manuals, 
5 such as Davis et al, Basic Methods In Molecular Biology (1986). 

Example 

Tomato (Lycopersicon esculentum Mill, cv. 'Rutgers') plants were grown 
in a greenhouse using standard cultural practices. The ripening mutants, 

10 ripening inhibitor (r/n), non-ripening {nor) and never ripe (Nr) (Tigchelaar et 

al, 1978), were all in the 'Rutgers' background. Flowers were tagged at 
anthesis and fruit were harvested according to the number of days post- 
anthesis (dpa) or based on their surface color using ripeness stages as 
previously described (Mitcham et al , 1 989), the complete disclosure of which 

15 is hereby fully incorporated herein by reference. For gene expression studies, 

a variety of leaf, flower, and stem tissues were harvested from greenhouse- 
grown plants and roots were harvested from seedlings grown in basal tissue 
culture medium for 4 weeks after seed germination. 

20 RNA Extraction 

Fruits were processed immediately after harvest in the greenhouse by 
chilling on ice, excising the various tissues and freezing them in liquid 
nitrogen. Tissue samples were ground using a mortar and pestle and stored at - 
80°C. RNA was extracted using the method described in Verwoerd et al. 
25 (1989). Poly(A)RNA was purified from total RNA using oligo(dT) columns 
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(Pharmacia, Piscataway, NJ). RNA was quantified by measuring A 2 eo using a 
dual beam spectrophotometer. 



Degenerate primers were designed based on the highest shared deduced 
amino acid sequence identity we found between an apple (accession number 
P48980), asparagus (P45582) and carnation (Q00662) p-galactosidase cDNA 
clones. The two primers used for the first reaction were BGS'El 
( WS NGGN WS N ATHC A YT A YCC) and BG3'E 

(CCRT A YTCRTCN ADNGGNGG) . A second reaction was done on the 
products of the first reaction using BG5T1 

(ATHCARACNTAYGTNTTYTGG) and BG3'R The degeneracy code for the 
primer sequences is N=a-H+c-fg; H=a+t+c; B=t+c+g; D=a+t+g; V=a+c+g; 
R=a-hg; Y=c+t; M=a+c; K=t+g; S=c+g; and W=a+t. The 5' and 3 1 primers 
corresponded to amino acids 72-78 and 321-315 of the apple clone, 
respectively. Amplification was done using AmpliTaq DNA polymerase 
(Perkin Elmer, Norwalk, CT) and standard PCR conditions using the cDNA 
made for the first cDNA library described below as a template (Ausubel et aL 9 
1987). PCR products were separated in an agarose gel and fragments of the 
expected size (approximately 750 bp) were purified, cloned into pCRscript 
(Stratagene, La Jolla, CA) , and sequenced. 



RT-PCR 



cDNA library 

Two cDNA libraries were constructed. The first comprised poly(A) RNA 
isolated from breaker, turning and pink fruit pericarp from 'Rutgers' plants. 
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The cDNA synthesis and library construction was done exactly according to 
the manufacturers instructions for the ZAP-cDNA Gigapack II Gold Cloning 
Kit (Stratagene), the complete disclosure of which is fully incorporated herein 
by reference. First-strand cDNA synthesis was primed using a poly(dT) 
primer and inserts were directionally cloned into the Uni-Zap XR vector using 
EcoRI and Xhol restriction sites. The second library comprised poly(A) RNA 
isolated from all fruit tissues (except seeds) from immature green, mature 
green, breaker, turning, pink, red-ripe and over-ripe fruit of 'Rutgers' plants. 
The cDNA synthesis and library construction was done exactly according to 
the manufacturers instructions for the Superscript Lambda System for cDNA 
synthesis and • Cloning (GibcoBRL, Gaithersburg, MD). First-strand cDNA 
synthesis was primed using a oligo(dT) primer and cDNA inserts were 
directionally cloned into the • ZipLox cloning vector using Sail and NotI 
restriction sites. Both libraries were amplified and maintained using the host 
strains provided by the manufacturer, according to their instructions. 

One of the clones (RT-PCR2-1) was used to screen 10 6 plaques from the 
tomato fruit cDNA libraries at low stringency (hybridization at 45°C, no 
formamide and final wash with 0.2X SSC at 42°C). Thirty positive cDNA 
clones were identified and partially sequenced. Complete sequencing and 
characterization of the RT-PCR and cDN A clones revealed the possibility of 
seven unique (3-galactosidase genes. 
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DNA and RNA Gel Blot Analysis 

Southern analysis was done using the 3 1 UTR of each full length clone 
and the RT-PCR clones as probes against restriction enzyme digested genomic 
DNA. DNA gel blot analysis was done essentially as described in Smith and 

5 Fedoroff (1995) except that 3 fig of genomic DNA was used for each digest. 

The genes corresponding to the clones appeared to be present as single copies 
(data not shown). The same probes were used to map 6 of the 7 genes using 
RFLPs of recombinant inbred lines and the loci names and map positions are 
shown in Table II (James Gioviannone, Texas A&M University, personal 

10 communication). 



Table II. TBG loci map positions. Genes were maped by Southern 
analysis using RFLPs of recombinant inbred lines. 



Gene 


chromosome 


map position 


TBG1 


12* 


overlap of IL 12-2, IL 12-3 


TBG2 


9 


IL9-3 


TBG3 


3 


IL 3-5 


TBG4 


12* 


overlap of IL 12-2, IL 12-3 


TBG 5 


11 


IL 11-3 


TBG6 


2 


overlap of IL 2-4, IL 2-5 


TBG7 


no RFLP 





*TBG1 and 4 are loosely linked 



15 



Total RNA (20 \ig/ lane) was separated in a formaldehyde/Mops 
agarose gel, transferred to Hybond-N + nylon membrane (Amersham, Arlington 
Heights, DL), fixed by incubating for 2 h at 80°C, hybridized overnight in a 
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hybridization incubator (Robbins Scientific, Sunnyvale, CA) using a buffer 
described by Church and Gilbert (1984) washed to a final stringency of 0.1 X 
SSC with 0.2% SDS at 65°C, and autoradiographed essentially as described by 
Ausubel et al (1987). An RNA ladder standard (GibcoBRL) was used to 

5 estimate the length of the RNAs. Probes were synthesized using a random 

priming kit with 32 P-dATP as the label Ooehringer Mannheim, Indianapolis, 
IN). Northern analysis was done using the 3' UTR of each full length clone 
and the RT-PCR clones as templates for probe synthesis. As a loading control, 
RNA blots were stripped and re-probed at a reduced hybridization and 

10 washing stringency using a soybean 26S rDNA fragment (Turano et al., 1997). 

For all hybridizations, 32 P(dATP)-labeled probe was diluted to 1-2 x 10 6 
dpm/mL. The complete disclosures of the above references are fully 
incorporated herein by reference. 



15 Sequence Analysis 

Sequencing was done at the Iowa State University Sequencing Facility 
(Ames, IA) using a PCR-based dideoxynucleotide terminator protocol and an 
ABI automated sequencer (Applied Biosystems, Foster City, CA). The 
sequencing of both cDNA insert strands was done by primer walking. 
20 Nucleotide and deduced amino acid sequence comparisons against the 

databases were done using BLAST searches (Altschul et al, 1990). Sequence 
data were analyzed and aligned using DNA Strider 1.2 (Marck, 1988) and 
MacDNAsis (Hitachi, San Bruno, CA) software. The complete disclosures of 
the above references are fully incorporated herein by reference. 
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Northern Blot Analysis 
Tissue Specific Expression 

Northern blot analysis was done to reveal which, if any, of the 
galactosidase genes had a fruit-specific expression pattern. With the exception 
of TBG2, transcripts of all clones were detected in non-fruit tissues (Fig. 4). 
Transcripts of TBG 1,4,5 and 6 were detected in all the tissues tested. TBG3 
transcript was detected at low levels in root and stem tissues, while TBG7 
transcript was detected in flower and stem tissues. 



Temporal expression pattern in fruit 

The temporal expression pattern of the seven genes in fruit tissue was 
examined using RNA extracted from all fruit tissues except seeds. Transcripts 
for all seven genes were detected during some stage of fruit development (Fig. 
5). TBG1 and 3 had similar expression patterns and their transcripts were 
detected throughout the breaker to over-ripe stages. TBG2 had a unique 
expression pattern and its transcript was detected at a constant level from 30 
dpp to the over ripe stage. TBG4 expression pattern was similar to TBG1 and 
3, but differed in that the transcript level was significantly higher at the turning 
stage. TBG5 had a similar expression pattern to TBG4 during the ripening 
stages of development, however TBG5 transcript was also detected throughout 
all the earlier stages of fruit development. TBG6 had an interesting expression 
pattern and its transcript was only detected at high levels in all pre-ripening 
stages tested. TBG7 also had a unique expression pattern and its transcript was 
detected at very low levels throughout all the stages tested, and at moderate 
levels at 10 dpp and the over-ripe stage. 
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Spatial expression pattern in fruit 

Northern blot analysis was also done to determine transcript 
accumulation in various fruit tissues. Since there were temporal differences in 
the expression patterns of the TBG genes both the mature green and turning 
fruit stages were used for RNA extractions (Fig. 6). Both TBG2 and TBG6 
transcripts were detected in all mature green fruit tissues tested. TBG7 
transcript was present in all fruit tissues tested except for locules. Both TBG1 
and TBG4 transcripts were detected in RNA samples extracted from all 
turning stage fruit tissues. TBG4 transcript was notably more abundant in the 
peel. TBG3 and TBG5 expression patterns were unique and their transcripts 
were detected in all tissues except the outer pericarp and locular respectively. 

Expression in normal versus mutant fruit 

In order to better understand the potential roles of the TBG products 
and transcriptional regulatory mechanisms, northern analysis was performed 
using fruit tissue from the ripening mutants rin, nor and AT. This analysis was 
important because it might give clues for preliminary determination of any 
potential ripening and/or softening role any of the TBGs might possess. 

The results of mutant fruit Northern analysis suggested that the 
transcriptional regulation of TBG1, 2, 3, 5 and 7 was unaffected in mutant fruit 
tissue and that their transcripts were present in a normal chronological (dpp) 
pattern (Fig. 7). The abundance of TBG4 and 6 transcripts were however 
different in the mutant fruit. TBG4 transcript was not detected in fruit tissue of 
and was detected at much lower levels in rin and nor than wild type fruit 
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tissues. Normally TBG6 transcripts are detectable at high levels throughout the 
early stages of fruit development but are not detectable after the mature green 
stage (40-42 dpp). TBG6 transcripts persisted even to 50 dpp in fruit of all 
three mutants. 

Transcriptional regulation by ethylene N 

The northern analysis done using mutant and wild type fruit suggested 
that TBG4 expression might be up-regulated by ethylene and that TBG6 
expression might be down-regulated by ethylene. In order to evaluate this 
hypothesis mature green fruit were harvested and subjected to a continuous 
flow of 10 ppm ethylene mixed in air. Control and ethylene-treated fruit were 
used for RNA extractions at 1, 2, 12 and 24 hours. The results of this 
experiment confirmed the findings from the mutant fruit northern analysis. As 
expected, the presence and abundance of TBG1, 2, 3, 5 and 7 transcripts was 
essentially unaffected in mature green tissues subjected to exogenous ethylene 
treatment (Fig. 8). However, TBG4 transcript abundance was increased in 
mature green tissues in the presence of ethylene. From the data presented it 
was unclear whether TBG6 transcript abundance was reduced by exogenous 
ethylene treatment since its transcript level was normally reduced at this stage 
of fruit development. 

Enzyme activity 

In order to determine the role of the TBG encoded products we 
initiated experiments to express the cDNA encoded enzymes using 
heterologous expression systems. Several E. coli expression systems were 
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tested, but the yield of product was very low due to toxicity ( See the example 
below). Therefore we used a yeast expression system which secretes a mature 
amino-terminal-FLAG fusion protein into the culture medium. The TBG4 
cDNA was tested first and resulted in the production of approximately 1 mg 
TBG4 active protein per 50 mis culture. TBG4 was used first because the 
cDNA codes for the enzyme {J-galactosi^ase II which was purified from 
tomato fruit and has been characterized in some detail (Carey et al 1995, Smith 
et al 1998). Therefore we could compare the activity of the heterologous 
system-expressed protein to the native enzyme purified from tomato. The 
TBG4 protein was successfully affinity purified using an anti-FLAG affinity 
resin (Figure 9). 

The affinity-purified TBG4 enzyme was shown to have (5(1— »4)-D- 
galactosidase activity by virtue of its ability to hydrolyze the synthetic 
substrate p-nitrophenyl-J3-D-galactopyranoside (Smith et al. 1998). The 
enzyme can cleave galactosyl residues from a variety of cell wall substrates 
and therefore has exo-galactanase activity (Table HI). The remaining full- 
length cDNA clones are currently being tested for successful expression of 
active enzyme. Preliminary results have shown that TBG1 codes for an 
enzyme which also has both p-D-galactosidase and exo-galactanase activity 
(Table ffl). 
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Table III. Cell wall degrading activity of TBG4 and TBG1 expressed 


in yeast. Removal of galactosyl residues from chelator soluble (CSP) 


and alkali soluble (ASP) pectin and hemicellulosic (HCF) celi wall 


fractions purified from tomato fruit. 






lag galactose 




released 


enzyme substrate 


boiled live 


a TBG4 CSP 


0 5 


ASP 


0 14.5 


HCF 


0 4 


b TBG1 ASP 


0 1,2 


2 mg substrate; 4 hours at 37°C 




a .005 units enzyme/rx 




b .0005 units enzyme/rx 




pZBG2-l-4 Codes for a p-Galactosidase 




The TBG4 ORF was cloned in-frame into the repressible/inducible 



bacterial expression vector pFLAG-CTC. The host strain XLl-Blue MR is a 
mutant strain containing no endogenous (3-galactosidase activity nor oe- 



complementation. Induction of gene transcription by (IPTG) caused the 
immediate cessation of E. coli growth at 30 to 37°C. However, induction at 
20°C did allow for some limited E. coli growth. When clones containing the 
pZBG2-l-4 4 ORF were grown at 20°C and induced with IPTG, the cells 
slowly turned blue after 36 hrs growth in medium containing the p- 
galactosidase substrate X-GAL, (Figure 10). If not induced with IPTG, no 
blue color was seen, even after extended growth in media containing X-GAL. 
As an additional negative control, clones consisting of XLl-Blue MR 
transformed with the FLAG vector alone never showed any P-galactosidase 
activity with or without EPTG-induction, even after 7-days growth (Fig 10). 



WO 99/64564 PCT/US99/I2697 
As a positive control for maximal p-galactosidase (derived from E. coli p- 
galactosidase) activity the cloning vector pGEM was transformed into the host 
strain DH5a and the results are also shown in Figure 10. Figure 10 shows the 
detection of p-galactosidase activity from pZBG2-l-4 expression in E. coll 
5 Cells were harvested and extracts were prepared every 12 hours and the A 6 i 5 

measured. Cultures were grown with the^ addition of the chromogenic substrate 
X-GAL (open symbols) or X-GAL and the transcriptional inducer EPTG 
(closed symbols) in the medium. The vector used as a positive control for E. 
coli p-galactosidase activity was pGEM (■) and the vector used as a negative 
10 control and for expression was pFLAG-CTC either without (o t m) or containing 

the pZBG2-l-4 ORF (a,a). 



Effects on Plant Tissue Texture 

To further demonstrate the function of TBG4 encoded P-galactosidase II 

15 the following experiments were carried out. 

Fruit from tomato plants containing antisense constructs to suppress 
TBG4 mRNA were up to 40% firmer [compare means of parental line #1 with 
antisense line #2 in Figures 1 1 A - 1 lE(l-4)] than fruit from the parental line. 
Among the transformants the line with the firmest fruit also had the lowest 

20 overall levels of TBG4 mRNA (Figure 12A,B). This correlation suggests that a 

reduction in TBG4 mRNA is associated with increased fruit firmness. Firmer 
fruit might result in (1) less shipping damage (a) less loss due to damage and 
(b) ability to harvest at later stage resulting in better flavor at market (2) longer 
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shelf life for both market and consumer. (3) better quality fruit for fresh slice 
market; fruit cut better at the pink/red stage when firmer. 



Methods 

To determine the function of TBG4 encoded (i-galactosidase n, antisense 
constructs were made using the constitutive] y expressed 35S CaMV promoter 
to express TBG4 antisense RNA (Figure 13). Constructs were moved into 
tomato using Agrobacterium-mediated transformation. Four tomato cultivars 
have been transformed in order to evaluate the effect of TBG4 suppression on 
processing tomato (cv 'UC82b') fruit paste quality and three fresh pick 
cultivars. Of the fresh pick cultivars one is a soft fruit large cherry tomato (cv 
4 Ailsa Craig'), the second is a soft fruit old breeding line (cv 'Rutgers') and 
the third is a recently developed somewhat firm cultivar 'New Rutgers'. 
Among the lines where TBG4 mRNA is suppressed we expect to observe an 
increase in firmness and paste viscosity. 

Texture 

Although this project is nearly finished the complete biochemical and 
molecular analysis is not finished. The preliminary results on the analysis of 
the 'New Rutgers' cultivar is presented in Figures 1 1 A - E(l-4) and 12A,B- In 
this example a fresh pick cultivar called 'New Rutgers' was used. Plants of the 
purchased seed were grown and allowed to self and the resulting seed was 
used as the parental control (line 1). Seven independent transformed plants 
(lines 2-8) containing TBG4 antisense constructs were grown and allowed to 
self. Transformation (T-DNA insertion) was confirmed by southern analysis 
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(data not shown). From each transformed line, five plants were grown along 
with 10 parental line plants. Fruit were tagged at the breaker stage (1 st onset of 
color change) and were harvested at breaker plus 7 days. Data were taken 
using 15-20 fruit from each line. Each type of texture measurement was done 
twice for each fruit and fruit were subjected to 4 types of texture 
measurements using a Stable Micro System's TA-XT2i texture analyzer. The 4 
measurements were; 1, 2-inch flat plate compression to 3 mm (Figure 1A), 2, 4 
mm spherical indenter compression to 3 mm (Figure 1 IB), 3, 4 mm cylindrical 
indenter compression to 3 mm (Figure 1 1C) and 4, 4 mm cylindrical indenter 
puncture to 10 mm (Figure 1 ID). The summary of this data is shown in 
Figure 1 lE(l-4). In Figures 1 1 A -E (1-4) line 1 was the parental line and lines 
2-8 each represent an independent transformant containing one T-DNA copy 
of the TBG4 antisense construct. Statistical analysis (Duncans and Scheffe) of 
the data revealed that fruit from the transformed lines 3, 7 and 8 were not 
significantly different from the parental line but that transformed lines 2, 4, 5 
and 6 were significantly firmer than the parental fruit. Most noteworthy is that 
fruit from transformed line 2 had fruit with a mean firmness that was 40% 
firmer than that of the parental line (Figures 1 1 A-D). 

Northern Blot Analysis 

We are currently investigating any changes in the biochemical 
composition of fruit where TBG4 mRNA levels have been suppressed. These 
experiments are designed to show a link between increased fruit firmness and 
TBG4 mRNA suppression, TBG4 encoded enzyme activity suppression, 
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possible cell wall modification (e.g. increased galactosyl residue content) and a 
decrease in free galactose levels during fruit ripening. 

These experiments are not complete, however some preliminary 
Northern blot experiments were done and the data is shown in Figure 12A,B. 
There is no parental or azygous control fruit RNA shown in Figure 12A,B 
because these plants were the last to grow, and RNA extractions are just being 
done now. As a comparison of normal fruit TBG4 mRNA levels refer to 
Figure 5 above. The data from Figure 5 showed that TBG4 mRNA levels are 
low at the mature green stage, peak at the turning stage and are reduced at the 
red stage. All the lines except for 2 and 3 expressed antisense TBG4 mRNA 
(Figure 12A,B). The antisense transcripts appear as two bands, smaller in 
length than the endogenous mRNA. The two bands probably resulted from 1, 
the expected transcriptional stop signal provided by the NOS-terminator and 2, 
a cryptic transcriptional stop signal in the antisense TBG4 cDNA. The most 
notable result was in line 2 where no TBG4 mRNA was detected at the turning 
stage. Line 2 also had the firmest red fruit (see Figure 1 1 A -D). The absence of 
detectable TBG4 mRNA probably was the result of cosupression of both the 
endogenous and antisense mRNAs. When compared to earlier blots (e.g. 
Figure 4), all of the lines appeared to have an overall reduced level of TBG4 
mRNA, but it is impossible to assign numbers to this statement without the 
parental and azygous control RNA on the same Northern blot. 

The specification discloses that p-galactosidase II polypeptide is involved 
in the degradation of cell wall pectin during fruit ripening. In the present 
invention, the role of p-galactosidases in tomato during fruit ripening and 
softening and the description of the cloning of a (i-galactosidase cDNA clone 
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that codes for a p(l-»4)galactan degrading enzyme, which is expressed in 
ripening tomato fruit tissues, has been shown. 

The present work indicates that pZBG2-l-4 is a cDNA derived from the 
transcript of the TBG4 gene which codes for p-galactosidase II for the 
following reasons: 

First, the deduced amino acid sequence of the highly conserved amino- 
terminal portion of the expected mature pZBG2-l-4 translation product 
matches almost exactly (28 of 30 amino acids) with the ami no-terminal 
sequence of P-galactosidase II as purified by Carey et al. (1995) and 
designated TOMAA. Importantly, the two amino acids (KY) in the [3- 
galactosidase II sequence (TOMAA), that do not match the pZBG2-l-4 
deduced amino acid sequence of the present invention are believed to be 
incorrect since all plant P-galactosidase sequences in the database and four 
additional p-galactosidase-related cDNAs that were identified from tomato all 
match or have conserved substitutions with the deduced amino acid sequence 
of pZBG2-l-4 at these same two amino acid (ST) positions (Figure 3). 

Second, the transcript detected by pZBG2-l-4 is present in normal 
ripening fruit at the same time that p-galactosidase II activity was detected 
(Figure 5; Carey et al, 1995). Moreover, little or no transcript was detected 
in fruit at 45 and 50 dpa from the mutants nor, rin and Nr (Figure 7). This 
observation also coincides with the data presented by Carey et al. (1995) that 
P-galactosidase II activity remained at levels equal to mature green fruit and 
did not rise in fruit 45-65 dpa from nor or rin plants. Interestingly, Carrington 
and Pressey (1996) have jeported that P-galactosidase II activity was only 
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detected in 'Rutgers' fruit after the turning stage of ripeness. The Northern 
data in the present invention indicates that maximum (3-galactosidase II 
activity occurs only after the turning stage, assuming mRNA levels predict 
extractable enzyme activity (Figure 5). 

Third, the apparent molecular weight of 77.9 kD and pi of 8.9 for the 
mature protein predicted from the pZBG2-l-4 sequence is similar to that 
determined for p-galactosidase EL, Pressey (1983), estimated a molecular 
weight of 62 kD by gel -filtration column chromatography and a pi of 7.8 by 
isoelectric focusing, while Carey et al (1995) estimated a molecular weight of 
75 kD by SDS-PAGE and a pi of 9.8 by isoelectric focusing. 

Fourth, enzyme produced from pZBG2-l-4 ORF using a heterologous 
yeast expression system has both (5-galactosidase activity and exogalactinase 
activity. 
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What we claim is: 



1. 



An isolated nucleic acid molecule comprising a polynucleotide 



having a nucleotide sequence at least 95% identical to a sequence selected 
from the group consisting of: 

(a) a nucleotide sequence encoding the [3-galactosidase II polypeptide 
having the complete amino acid sequence selected from the group consisting 



of SEQ ID NO:8, SEQ ID NO: 9, SEQ ID NO: 10, SEQ ID NO: 1 1, SEQ ID 



NO: 12, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 15 and SEQ ID NO: 16 
and designated TBG1, TBG2, TBG3, TBG4, TBG5, TBG6 and TBG7, 
respectively as shown in Figure 2 or as encoded by the cDNA clone selected 
from the group consisting of cDNA clones contained in Gen Bank Accession 
No. AF023847, AFI54420, AF154421, AF020390, AF154423, AF154424 and 
AF1 54422; 

(b) a nucleotide sequence encoding the mature (5-galactosidase II 
polypeptide having the amino acid sequence at about positions 24-724 selected 
from the group consisting of SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO: 10, 
SEQ ED NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID 
NO: 15 and SEQ ID NO: 16 and designated TBG1, TBG2, TBG3, TBG4, 
TBG5, TBG6 and TBG7, respectively as shown in Figure 2 or as encoded by 
the cDNA clone selected from the group consisting of cDNA clones contained 
in Gen Bank Accession No. AF023847, AF154420, AF154421, AF020390, 
AF154423, AF154424 and AF154422; and 

(c) a nucleotide sequence complementary to any of the nucleotide 
sequences in (a) or (b), above. 

2. The nucleic acid molecule of claim 1 wherein said 
polynucleotide has the complete nucleotide sequence selected from the group 
consisting of SEQ ID NO: 1, SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, 
SEQ ID NO:5, SEQ ID NO:6 and SEQ ID NO:7 as shown in Figure 2. 
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3. The nucleic acid molecule of claim 1 wherein said 
polynucleotide has the nucleotide sequence in Figure 2 (SEQ ID NO:4) 
encoding the j}-galactosidase II polypeptide having the amino acid sequence 
designated TBG4 in Figure 2. 

4. The nucleic acid molecule of claim 1 wherein said 
polynucleotide has the nucleotide sequence in Figure 2 (SEQ ID NO:4) 
encoding the mature polypeptide having the amino acid sequence from about 
24 to about 724 in the amino acid sequence designated TBG4 in Figure 2. 

5. The nucleic acid molecule of claim 1 wherein said 
polynucleotide has the complete nucleotide sequence of the cDNA clone 
contained in Gen Bank Accession No. AF023847. 

6. The nucleic acid molecule of claim 1 wherein said 
polynucleotide has the complete nucleotide sequence of the cDN A clone 
contained in Gen Bank Accession No. AF1 54420. 

7. The nucleic acid molecule of claim 1 wherein said 
polynucleotide has the complete nucleotide sequence of the cDNA clone 
contained in Gen Bank Accession No. AF 154421. 

8. The nucleic acid molecule of claim 1 wherein said 
polynucleotide has the complete nucleotide sequence of the cDNA clone 
contained in Gen Bank Accession No. AF020390. 



9. The nucleic acid molecule of claim 1 wherein said 
polynucleotide has the complete nucleotide sequence of the cDNA clone 
contained in Gen Bank Accession No. AF 154423. 
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10. The nucleic acid molecule of claim 1 wherein said 
polynucleotide has the complete nucleotide sequence of the cDNA clone 
contained in Gen Bank Accession No. AF1 54424. 

1 1 . The nucleic acid molecule of claim 1 wherein said 
polynucleotide has the complete nucleotide sequence of the cDNA clone 
contained in Gen Bank Accession No. AF1 54422. 

12. An isolated nucleic acid molecule comprising a polynucleotide 
which hybridizes under stringent hybridization conditions to a polynucleotide 
having a nucleotide sequence identical to a nucleotide sequence in (a), (b), or 
(c) of claim 1 wherein said polynucleotide which hybridizes does not hybridize 
under stringent hybridization conditions to a polynucleotide having a 
nucleotide sequence consisting of only A residues or of only T residues. 

13. An isolated nucleic acid molecule comprising a polynucleotide 
which encodes the amino acid sequence of an epitope-bearing portion of a p- 
galactosidase II polypeptide having an amino acid sequence in (a), (b), or (c) 
of claim L 

14. A method for making a recombinant vector comprising 
inserting an isolated nucleic acid molecule of claim 1 into a vector. 

15. A recombinant vector produced by the method of claim 14. 

16. A method of making a recombinant host cell comprising 
introducing the recombinant vector of claim 15 into a host cell. 

17. A recombinant host cell produced by the method of claim 1 6. 
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18. A recombinant method for producing (3-galactosidase II 
polypeptide, comprising culturing the recombinant host cell of claim 17 under 
conditions such that said polypeptide is expressed and recovering said 
polypeptide. 

19. An isolated {3-galactosidase II polypeptide comprising an amino 
acid sequence at least 95% identical to a sequence selected from the group 
consisting of : 

a) amino acid sequence at about positions 24-724 selected from the 
group consisting of sequences SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO: 
10, SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 14, SEQ 
ID NO: 15 and SEQ ID NO: 16 and designated TBG1, TBG2, TBG3, TBG4, 
TBG5, TBG6 and TBG7, respectively as shown in Figure 2; and 

b) amino acid sequence as encoded by the cDN A clone selected from 
the group consisting of cDNA clones contained in Gen Bank Accession No. 
AF023847, AF154420, AF 154421, AF020390, AF154423, AF154424 and 
AF1 54422. 

20. An isolated polypeptide comprising an epitope-bearing portion 
of the p-galactosidase It protein. 



21 . An isolated antibody that binds specifically to a p-galactosidase 
II polypeptide of claim 20. 
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22. An isolated nucleic acid molecule nucleic acid molecule 
comprising a polynucleotide having a nucleotide sequence at least 95% 
identical to a sequence selected from the group consisting of: 

(a) a nucleotide sequence encoding the p-galactosidase II polypeptide 
having the complete amino acid sequence selected from the group consisting 
of SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO: 10, SEQ ID NO: 1 1 , SEQ ID 
NO: 12, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 15 and SEQ ID NO: 16 
and designated TBG1, TBG2, TBG3, TBG4, TBG5, TBG6 and TBG7, 
respectively as shown in Figure 3 or as encoded by the cDNA clone selected 
from the group consisting of cDNA clones contained in Gen Bank Accession 
No. AF023847, AF154420, AF154421, AF020390, AF154423, AF154424 and 
AF1 54422; 

(b) a nucleotide sequence encoding the mature (i-galactosidase II 
polypeptide having the amino acid sequence at about positions 24-724 selected 
from the group consisting of sequences SEQ ID NO: 8, SEQ ID NO: 9, SEQ 
ID NO: 10, SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 
14, SEQ ID NO: 15 and SEQ ID NO: 16 and designated TBG1, TBG2, TBG3, 
TBG4, TBG5, TBG6 and TBG7, respectively as shown in Figure 3 or as 
encoded by the cDNA clone selected from the group consisting of cDNA 
clones contained in Gen Bank Accession No. AF023847, AF1 54420, 
AF154421, AF020390, AF154423, AF154424 and AF154422; and 

(c) a nucleotide sequence complementary to any of the nucleotide 
sequences in (a) or (b), above. 



23. The nucleic acid molecule of claim 22 wherein said 
polynucleotide has a complete nucleotide sequence in Figure 2 selected from 
the group consisting of SEQ ID NOs: 1-3 and 5-7. 
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24. The nucleic acid molecule of claim 22 wherein said 
polynucleotide has a nucleotide sequence in Figure 2 selected from the group 
consisting of SEQ ID NOs: 1-3 and 5-7 encoding the p-galactosidase 
polypeptide having the complete amino acid sequence designated TBG1-3 and 
5-7, respectively. 



25. The nucleic acid molecule of claim 22 wherein said 
polynucleotide has the nucleotide sequence in Figure 2 selected from the 
group consisting of SEQ ID NOs: 1-3 and 5-7encoding the mature polypeptide 
having the amino acid sequence designated TBG1-3 and 5-7, respectively. 

26. The nucleic acid molecule of claim 22 wherein said 
polynucleotide has the complete nucleotide sequence of the cDNA clone 
contained in an Gen Bank Accession No. selected from the group consisting of 
ATCC Deposit No. selected from the group consisting of AF023847, 
AF154420, AF154421, AF020390, AF154423, AF154424 and AF154422. 

27. A method of modifying cell wall metabolism in a plant which 
comprises transforming said plant with a DNA construct adapted to modify the 
activity of a (3-galactosidase, growing said plant or its descendent and selecting 
a plant having modified cell wall characteristics, said construct comprising a 
transcriptional initiation region operative in plants operably linked to a DNA 
sequence encoding at least one (5-galactosidase. 

28. A method as claimed in claim 27, wherein said DNA sequence 
is selected from the group consisting of the sequences of nucleic acid 
molecules claimed in claim 1 or claim 22. 

29. A plant cell transformed with a nucleic acid molecule as 
claimed in claim 1 or claim 22. 



30. A plant derived from a plant cell as claimed in claim 29. 
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31. A plant seed derived from a plant as claimed in claim 30. 

32. A method for modifying (3-galactosidase gene expression in a 
plant comprising transforming said plant with a nucleic acid molecule as 
claimed in claim 1 or claim 22, growing the transformed plant and selecting a 
plant having modified (3-galactosidase gene expression when compared with 
an untransformed plant. 
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Figure 1. (3-Galactosidase phylogenetic tree based on shared amino acid 
sequence identity. A. Tomato p-ga!actosidase (TBG) cDNAs. B. Plant 
p-galactosidases. Higgins-Sharp algorithm (UPGMA method) 
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Figure 2 
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G*n«/clone name: TBQl/pZB<32-l-10/ acce#*ion rnuntmr AF023847; seaaence ZD number 1 

31 AGCCTAGAAGAAGGAAAAAAAGAAGTA^^ 122 

X23 AAAACAGCTGTCTTCCCTTC 214 

215 GAATAATAAGCTGTGGGGGTAGGSAGGAAGT^ 305 

306 ATG GOT TTT TGG ATG GCA ATS TTG CTG ATG TTG TTA TTG TGT TEA TOG GTT TCT TGT GGA ATT OCT TCT 374 

1 Met Gly Phe Trp Met Ala Met Leu Leu Met Leu Leu Leu Cys Leu Trp Vol Ser Cys Gly lie Ala Ser 23 

375 GTT TCA TAT GAC CAT AAA GCT ATC ATT GTA AAT GGA CAA AGA AAA ATT CTC ATT TCT GGA TCC ATT CAC 443 

24 Val Ser Tyr Asp His Lye Ala lie He Val Asn Gly Gin Arg Lys He Leu He Ser Gly Ser He Hie 46 

444 TAC CCT AGA AGC ACC CCT GAG ATG TGG CCA GAT CTT ATT CAG AAG GCA AAA GAA GGG GGA GTT GAT GTT 512 

4? Tyr Pro Arg Ser Thr Pro Glu Met Trp Pro Asp Leu He Gin Lys Ala Lys Glu Gly Gly Val Asp Val 69 

513 ATA CAG ACT TAT GTT TTC TGG AAT GGG CAT GAG CCT GAA GAA GGG AAA TAT TAT TTT GAA GAG AGG TAT 581 

70 He Gin Thr Tyr Val Phe Trp Asn Gly His Glu Pro Glu Glu Gly Lys Tyr Tyr Phe Glu Glu Arg Tyr 92 

582 GAT TTA GTG AAG TTC ATT AAA GTG GTG CAA GAA GCA GGA CTT TAT GTG CAT CTT AGG ATT GGA CCT TAT 650 

93 Asp Leu Val Lys Phe He Lys Val Val Gin Glu Ala Gly Leu Tyr Val His Leu Arg He Gly Pro Tyr 115 

651 GCA TGT GCT GAA TGG AAT TTT GGG GGT TTT CCT GTT TGG CTG AAG TAT GTT CCA GGT ATT AGT TTC AGA 719 

116 Ala Cys Ala Glu Trp Asn Phe Gly Gly Phe Pro Val Trp Leu Lys Tyr Val Pro Gly He Ser Phe Arg 138 

720 ACA AAC AAT GAG CCA TTC AAG GCT GCA ATG CAA AAG TTC ACT ACT AAG ATT GTT GAT ATG ATG AAA GCA 788 

139 Thr Asn Asn Glu Pro Phe Lys Ala Ala Met Gin Lys Phe Thr Thr Lys He Val Asp Met Met Lys Ala 161 

789 GAA AAG CTC TAT GAA ACT CAG GGT GGT CCA ATT ATT CTA TCT CAG ATA GAA AAT GAA TAT GGA CCT ATG 857 

162 Glu Lys Leu Tyr Glu Thr Gin Gly Gly Pro He He Leu Ser Gin He Glu Asn Glu Tyr Gly Pro Met 184 

858 GAG TGG GAA CTA GGT GAA CCT GGT AAA GTT TAC TCA GAA TGG GCA GCC AAA ATG GCT GTG GAT CTT GGC 926 

185 Glu Trp Glu Leu Gly Glu Pro Gly Lys Val Tyr Ser Glu Trp Ala Ala Lys Met Ala Val Asp Leu Gly 207 

927 ACT GGT GTC CCA TGG ATC ATG TGC AAG CAA GAT GAT GTC CCT GAT CCT ATT ATT AAT ACT TGC AAT GGT 995 

208 Thr Gly Val Pro Trp He Met Cys Lys Gin Asp Asp Val Pro Asp Pro He He Asn Thr Cys Asn Gly 230 

996 TTC TAC TGT GAC TAC TTC ACA CCA AAT AAG GCT AAT AAA CCC AAG ATG TGG ACT GAA GCC TGG ACA GCC 1064 

231 Phe Tyr Cys Asp Tyr Phe Thr Pro Asn Lys Ala Asn Lys Pro Lys Met Trp Thr Glu Ala Trp Thr Ala 253 

1065 TGG TTT ACC GAA TTT GGA GGT CCA GTT CCT TAC CGT CCT GCA GAG GAT ATG GCA TTT GCT GTC GCA AGA 1133 

254 Trp Phe Thr Glu Phe Gly Gly Pro Val Pro Tyr Arg Pro Ala Glu Asp Met Ala Phe Ala Val Ala Arg 276 

1134 TTT ATA CAA ACG GGA GGC TCC TTC ATC AAT TAC TAT ATG TAT CAT GGA GGA ACA AAC TTT GGA AGG ACT 1202 

277 Phe He Gin Thr Gly Gly Ser Phe He Asn Tyr Tyr Met Tyr His Gly Gly Thr Asn Phe Gly Arg Thr 299 

1203 TCT GGT GGC CCA TTT ATT GCT ACT AGT TAT GAT TAT GAT GCA CCC CTA GAT GAA TTT GGG TCA TTA CGG 1271 

300 Ser Gly Gly Pro Phe He Ala Thr Ser Tyr Asp Tyr Asp Ala Pro Leu Asp Glu Phe Gly Ser Leu Arg 322 

1272 CAG CCT AAA TGG GGT CAT CTG AAA GAT CTA CAT AGA GCA ATA AAG CTC TGT GAG CCA GCT TTA GTA TCT 1340 

323 Gin Pro Lys Trp Gly His Leu Lys Asp Leu His Arg Ala He Lys Leu Cys Glu Pro Ala Leu Val ser 345 

1341 GTA GAT CCA ACT GTG ACA TCC TTA GGA AAC TAT CAA GAG GCA CGT GTT TTC AAG TCA GAG TCT GGG GCC 1409 

346 Val Asp Pro Thr Val Thr Ser Leu Gly Asn Tyr Gin Glu Ala Arg Val Phe Lys Ser Glu Ser Gly Ala 368 

1410 TGC GCT GCC TTC CTA GCA AAT TAC AAC CAG CAC TCT TTT GCT AAA GTG GCA TTT GGG AAC ATG CAT TAT 1478 

369 Cys Ala Ala Phe Leu Ala Asn Tyr Asn Gin His Ser Phe Ala Lys Val Ala Phe Gly Asn Met His Tyr 391 

1479 AAC TTG CCA CCC TGG TCT ATC AGC ATT CTT CCC GAC TGC AAG AAC ACT GTC TAT AAT ACT GCA AGG GTT 1547 

392 Asn Leu Pro Pro Trp Ser lie Ser He Leu Pro Asp Cys Lys Asn Thr Val Tyr Asn Thr Ala Arg Val 414 

1548 GGT GCT CAA AGT GCT CAG ATG AAG ATG ACT CCA GTC AGT AGA GGA TTC TCA TGG GAG TCA TTC AAT GAA 1616 

415 Gly Ala Gin Ser Ala Gin Met Lys Met Thr Pro Val Ser Arg Gly Phe Ser Trp Glu Ser Phe Asn Glu 437 



9 09/701 



' W °^564 PCT/US99/12697 

3 / 31 



Figure 2 
Sheet 2 of 12 

Gan* /clone names TBGl/pZBG2^PlO ; «cc«s«io& number AJ023847; Seo^BPa ZD nunb«r 1 coat. 



1617 GAC GCA GCA TOG CAT GAA GAC GAC ACT TIC ACA GTT GTT GGG TTA TTG GAG CAG ATT AAT ATC ACA AGA 168S 

438 Asp Ala Ala Ser His Glu Asp Asp Thr Phe Thr Val Val Gly Leu Leu Glu Gin lie Asn lie Thr Arg 460 

1686 GAT GTA TCT GAT TAG TTG TGG TAT ATG ACT GAC ATT GAG ATT GAT CCA ACA GAA GGA TTT TTG AAT AGT 1754 

461 Asp Val Ser Asp Tyr I>eu Trp Tyr Met Thr Asp lie Glu lie Asp Pro Thr Glu Gly Phe Leu Asn Ser 483 

1755 GGA AAT TGG CCT TGG CTT ACT GTC TTT TCT GOT GGC CAT GCA TTG CAT GTA TTC GTG AAT GGT CAA TTA 1823 

484 Gly Asn Trp Pro Trp Leu Thr Val Phe Ser Ala Gly His Ala Leu His Val Phe Val Asn Gly Gin Leu 506 

1824 GCA GGA ACT GTG TAC GGA AGT TTA GAA AAC CCA AAA CTA ACT TTC AGC AAC GGT ATA AAT CTG AGA GCT 1892 

507 Ala Gly Thr Val Tyr Gly Ser Leu Glu Asn Pro Lys Leu Thr Phe Ser Asn Gly lie Asn Leu Arg Ala 529 

1893 GGT GTG AAC AAG ATT TCT CTG CTA AGC ATT GCT GTT GGT CTT CCG AAC GTT GGC CCT CAT TTT GAG ACA 1961 

530 Gly Val Asn Lys lie Ser Leu Leu Ser lie Ala Val Gly Leu Pro Asn Val Gly Pro His Phe Glu Thr " 552 

1962 TGG AAT GCT GGT GTT CTT GGA CCA GTT TCA CTT AAT GGA CTT AAT GAA GGA ACA AGA GAT TTA ACA TGG 2030 

5S3 Trp Asn Ala Gly Val Leu Gly Pro Val Ser Leu Asn Gly Leu Asn Glu Gly Thr Arg Asp Leu Thr Trp 575 

2031 CAG AAA TGG TTC TAC AAG GTT GGT CTA AAA GGA GAA GCC CTG AGT CTT CAT TCA CTC AGT GGT AGC CCA 2099 

576 Gin Lys Trp Phe Tyr Lys Val Gly Leu Lys Gly Glu Ala Leu Ser Leu His Ser Leu Ser Gly Sex Pro 598 

2100 TCC GTG GAG TGG GTG GAA GGC TCT TTA GTG GCT CAG AAG CAG CCA CTC AGT TGG TAT AAG ACT ACA TTC 2168 

599 Ser Val Glu Trp Val Glu Gly Ser Leu Val Ala Gin Lys Gin Pro Leu Ser Trp Tyr Lys Thr Thr Phe 621 

2169 AAT GCT CCA GAT GGA AAT GAA CCT TTG GCT TTA GAT ATG AAT ACC ATG GGC AAA GGT CAA GTA TGG ATA 2237 

622 Asn Ala Pro Asp Gly Asn Glu Pro Leu Ala Leu Asp Met Asn Thr Met Gly Lys Gly Gin Val Trp He 644 

2238 AAT GGT CAG AGC CTC GGA CGC CAC TGG CCT GCA TAT AAA TCA TCT GGA AGT TOT AGT GTC TGT AAC TAT 2306 

645 Asn Gly Gin Ser Leu Gly Axg His Trp Pro Ala Tyr Lys Ser Ser Gly Ser Cys Ser Val Cys Asn Tyr 667 

2307 ACT GGC TGG TTT GAT GAG AAA AAG TGC CTA ACT AAC TGT GGT GAG GGC TCA CAA AGA TGG TAC CAC GTA 2375 

666 Thr Gly Trp Phe Asp Glu Lys Lys Cys Leu Thr Asn Cys Gly Glu Gly Ser Gin Arg Trp Tyr His Val 690 

2376 CCC CGG TCT TGG CTG TAT CCT ACT GGA AAT TTG TTA GTT GTA TTC GAG GAA TGG GGA GGA GAT CCT TAT 2444 

691 Pro Arg Ser Trp Leu Tyr Pro Thr Gly Asn Leu Leu Val Val Phe Glu Glu Trp Gly Gly Asp Pro Tyr 713 

2445 GGA ATC ACT TTA GTC AAA AGA GAA ATA GGG AGT GTT TGT GCT GAT ATA TAT GAG TGG CAA CCA CAG TTA 2513 

714 Gly He Thr Leu Val Lys Arg Glu He Gly Ser Val Cys Ala Asp He Tyr Glu Trp Gin Pro Gin Leu 736 

2514 TTG AAT TGG CAG AGG CTA GTA TCT GGT AAG TTT GAC AGA CCT CTC AGA CCT AAA GCC CAT CTT AAG TGT 2582 

737 Leu Asn Trp Gin Arg Leu Val Ser Gly Lys Phe Asp Arg Pro Leu Arg Pro Lys Ala His Leu Lys Cys 759 

2583 GCA CCT GGT CAG AAG ATT TCT TCA ATC AAA TTT GCA AGC TTT GGA ACA CCA GAG GGA GTT TGT GGG AAC 2651 

760 Ala Pro Gly Gin Lys lie Ser Ser He Lys Phe Ala Ser Phe Gly Thr Pro Glu Gly Val Cys Gly Asn 782 

2652 TTC CAG CAG GGA AGC TGC CAT GCT CCG CGC TCA TAT GAT GCT TTC AAA AAG AAT TGT GTT GGG AAA GAG 2720 

783 Phe Gin Gin Gly Ser Cys His Ala Pro Arg Ser Tyr Asp Ala Phe Lys Lys Asn Cys Val Gly Lys Glu 805 

2721 TCT TGC TCA GTA CAG GTA ACA CCA GAG AAT TTT GGA GGT GAT CCA TGT OGA AAC GTT CTA AAG AAA CTC 2789 

806 Ser Cys Ser Val Gin Val Thr Pro Glu Asn Phe Gly Gly Asp Pro Cys Arg Asn Val Leu Lys Lys Leu 828 

2790 TCA GTG GAA GCC ATT TGT AGT TGA TGATTCTGAGTATACAAGTGAAAAAATACTTGAA 2873 

829 Ser Val Glu Ala He Cys Ser *** 836 

2874 AGCTACTAGACATCCATTAACCCACACTACC^ 2965 

2966 CHCACOTGATTATGAAGATGATTGACT 3057 

3058 GTGCAAATTTTGTTTGTGTTAGGGT 3149 

3150 AAGCAATAATTCATTGCTTTGCACATT^ 3224 



WO 99/64564 PCT/US99/12697 

4 / 31 



Figure 2 
Sheet 3 of 12 

Gene/clone name t TBG2 /pZBG2-l-12 ; accession number AF154420i Sequence ZD number 2 



1 GG 2 

3 AGC AGA AGA AAA ACA CTG AAT TTT CCG TTA ATA CTA ACG GTG TTA ACT ATC CAC TTT GTG ATC GTC GCC 71 

1 Ser Arg Arg Lys Thr Leu Asn Phe Pro Leu lie Leu Thr Val Leu Thr lie His Phe Val He Val Ala 23 

72 GGC GAG TAT TTC AAG COG TTC AAT GTC ACC TAC GAT AAC CGA GCT CTC ATC ATC GGC GOT- AAA CGC CGT 140 

24 Gly Glu Tyr Phe Lys Pro Phe Asn Val Thr Tyr Asp Asn Arg Ala Leu tie He Gly Gly Lys Arg Arg 46 

141 ATG CTT ATC TCC GCC GGA ATT CAC TAC CCT CGC GCC ACT CCT GAG ATG TGG CCC ACA TTG ATA GCT AGG 209 

47 Met Leu He Ser Ala Gly lie His Tyr Pro Arg Ala Thr Pro Glu Met Trp Pro Thr Leu He Ala Arg 69 

210 AGC AAA GAA GGT GOT GCA GAT GTC ATC GAG ACT TAT ACA TTT TGG AAT GOT CAT GAG CCA ACC AGG GGA 278 

70 Ser Lys Glu Gly Gly Ala Asp Val He Glu Thr Tyr Thr Phe Trp Asn Gly His Glu Pro Thr Arg Gly 92 

279 CAG TAC AAT TTT GAA GGA AGA TAT GAT ATT GTC AAG TTC GCA AAG CTA GTC GGA TCT CAT GGA CTG TTC 347 

93 Gin Tyr Asn Phe Glu Gly Arg Tyr Asp He Val Lys Phe Ala Lys Leu Val Gly Ser His Gly Leu Phe 115 

348 CTC TTT ATT CGA ATA GGT CCT TAT GCC TGT GCA GAA TGG AAC TTC GGG GGA TTC CCC ATA TGG CTT CGT 416 

116 Leu Phe He Arg He Gly Pro Tyr Ala Cys Ala Glu Trp Asn Phe Gly Gly Phe Pro He Trp Leu Arg 13 B 

417 GAT ATA CCT GGA ATA GAA TTT CGA ACA GAT AAT GCA CCA TTC AAG GAG GAG ATG GAG CGC TAT GTT AAA 485 

139 Asp He Pro Gly He Glu Phe Arg Thr Asp Asn Ala Pro Phe Lys Glu Glu Met Glu Arg Tyr Val Lys 161 

486 AAG ATA GTT GAT CTT ATG ATA TCT GAG TCG CTC TTT TOG TGG CAA GGT GGT CCT ATC ATT TTG CTG CAG 554 

162 Lys He Val Asp Leu Met He Ser Glu Ser Leu Phe Ser Trp Gin Gly Gly Pro He He Leu Leu Gin 184 

555 ATT GAA AAT GAA TAT GGA AAT GTT GAA AGC TCA TTC GGT CCC AAG GGG AAG TTA TAT ATG AAA TGG GCT 623 

185 He Glu Asn Glu Tyr Gly Asn Val Glu Ser Ser Phe Gly Pro Lys Gly Lys Leu Tyr Met Lys Trp Ala 207 

624 GCT GAA ATG GCT GTT GGT CTT GGT GCT GGT GTT CCA TGG GTC ATG TGC AGG CAA ACT GAT GCT CCA GAA 692 

208 Ala Glu Met Ala Val Gly Leu Gly Ala Gly Val Pro Trp Val Met Cys Arg Gin Thr Asp Ala Pro Glu 230 

693 TAC ATC ATA GAT ACT TGT AAT GCA TAC TAT TGT GAT GGG TTC ACG CCG AAT TCC GAG AAG AAA CCG AAA 761 

231 Tyr lie He Asp Thr Cys Asn Ala Tyr Tyr Cys Asp Gly Phe Thr Pro Asn Ser Glu Lys Lys Pro Lys 253 

762 ATT TGG ACT GAG AAT TGG AAT GGA TGG TTT GCA GAT TGG GGT GAA AGA CTT CCA TAT AGA CCT TCC GAG 830 

254 He Trp Thr Glu Asn Trp Asn Gly Trp Phe Ala Asp Tip Gly Glu Arg Leu Pro Tyr Arg Pro Ser Glu 276 

831 GAT ATT GCA TTT GCA ATT GCT CGT TTC TTT CAA CGT GGG GGC AGC TTA CAG AAC TAT TAT ATG TAT TTT 899 

277 Asp He Ala Phe Ala He Ala Arg Phe Phe Gin Arg Gly Gly Ser Leu Gin Asn Tyr Tyr Met Tyr Phe 299 

900 GGT GGG ACA AAT TTT GGC COG ACT GCT GGT GGC CCA ACT CAA ATC ACT AGC TAT GAT TAT GAT GCT CCA 968 

300 Gly Gly Thr Asn Phe Gly Arg Thr Ala Gly Gly Pro Thr Gin lie Thr Ser Tyr Asp Tyr Asp Ala Pro 322 

969 CTG GAT GAA TAT GGA CTA CTA CGT CAA CCT AAA TGG GGC CAT TTG AAG GAT CTG CAT GCT GCT ATA AAG 1037 

323 Leu Asp Glu Tyr Gly Leu Leu Arg Gin Pro Lys Trp Gly His Leu Lys Asp Leu His Ala Ala lie Lys 345 

1038 CTT TGT GAA CCA GCT CTT GTT GCT GCT GAT TCA CCT CAG TAT ATT AAA CTG GGA CCA AAA CAG GAG GCA 1106 

346 Leu Cys Glu Pro Ala Leu Val Ala Ala Asp Ser Pro Gin Tyr He Lys Leu Gly Pro Lys Gin Glu Ala 368 

1107 CAT GTC TAT CGT GGA ACA TCC AAC AAC ATT GGC CAA TAT ATG TCC TTA AAT GAA GGC ATA TGC GCA GCA 1175 

369 His Val Tyr Arg Gly Thr Ser Asn Asn He Gly Gin Tyr Met Ser Leu Asn Glu Gly He Cys Ala Ala 391 

2176 TTT ATT GCA AAT ATT GAT GAA CAT GAA TCA GCA ACA GTG AAA TTT TAC GGT CAA GAG TTC ACT TTA CCT 1244 

392 Phe He Ala Asn He Asp Glu His Glu Ser Ala Thr Val Lys Phe Tyr Gly Gin Glu Phe Thr Leu Pro 414 

1245 CCA TGG TCA GTG GTA TTC TGC CAG ATT GCA GAA ATA CAG CTT TCA ACA CAG CTA AGG TGG GGG CAC AAA 1313 

415 Pro Trp Ser Val Val Phe Cys Gin He Ala Glu He Gin Leu Ser Thr Gin Leu Arg Trp Gly Kis Lys 437 

1314 CTT CAA TCA AAA CAG TGG GCT CAG ATT CTG TTT CAG TTG GGA ATA ATT CTT TGT TTC TAC AAG TTA TCA 1382 

43B Leu Gin Ser Lys Gin Trp Ala Gin He Leu Phe Gin Leu Gly He He Leu Cys Phe Tyr Lys Leu Ser 460 
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1383 CTA AAA GCA AGC TCG GAA AGT TTT TCA CAA TCT TGG ATG ACA TTG AAG GAG CCA CTT GGT GTG TOG GGT 1451 

461 Leu Lys Ala Ser Ser Glu Ser Phe Ser Gin Ser Trp Met Thr Leu Lys Glu Pro Lev Gly Val Trp Gly 483 

1452 GAC AAG AAT TTC ACT TCT AAA GGA ATA CTG GAG CAT CTG AAT GTG ACA AAA GAC CAG TCT GAT TAC CTG 1520 

484 Asp Lys Asn Phe Thr Ser Lys Gly lie Leu Glu His Leu Asn Val Thr Lys Asp Gin Ser Asp Tyr Leu 506 

1521 TGG TAT CTG *ACC AGG ATA TAT ATT TCT GAT GAT GAC ATC TCA TTT TGG GAG GAA AAT GAT GTT AGT CCA 1589 

507 Trp Tyr Leu Thr Arg lie Tyr lie Ser Asp Asp Asp lie Ser Phe Trp Glu Glu Asn Asp Val Ser Pro 529 

1590 ACA ATT GAT ATT GAT AGC ATG CGT GAT TTT GTT CGC ATT TTT GTT AAT GGG CAG CTT GCA GGT AGT GTG 1658 

530 Thr lie Asp lie Asp Ser Met Arg Asp Phe Val Arg lie Phe Val Asn Gly Gin Leu Ala Gly Ser Val S52 

1659 AAA GGC AAA TGG ATC AAG GTG GTT CAA CCT GTT AAG CTG GTT CAG GGA TAC AAC GAC ATA CTG CTA TTA 1727 

553 Lys Gly Lys Trp He Lys Val Val Gin Pro Val Lys Leu Val Gin Gly Tyr Asn Asp He Leu Leu Leu 575 

1728 TCT GAG ACG GTG GGA TTG CAG AAT TAT GGT GCC TTC TTG GAG AAG GAT GGG GCA GGT TTT AAA GGT CAG 1796 

576 Ser Glu Thr Val Gly Leu Gin Asn Tyr Gly Ala Phe Leu Glu Lys Asp Gly Ala Gly Phe Lys Gly Gin 598 

1797 ATA AAG CTT ACA GGA TGC AAA AGC GGG GAT ATC AAT CTC ACA ACA TCT TTA TGG ACC TAC CAG GTG GGG 186S 

599 He Lys Leu Thr Gly Cys Lys Ser Gly Asp He Asn Leu Thr Thr Ser Leu Trp Thr Tyr Gin Val Gly 621 

1866 CTT AGA GGC GAA TTC CTG GAA GTA TAT GAT GTC AAT AGT ACT GAA AGT GCA GGA TGG ACT GAG TTT CCC 1934 

622 Leu Arg Gly Glu Phe Leu Glu Val Tyr Asp Val Asn Ser Thr Glu Ser Ala Gly Trp Thr Glu Phe Pro 644 

1935 ACT GGT ACA ACT CCG TCA GTC TTT TCG TGG TAC AAG ACA AAG TTT GAT GCC CCA GGC GGG ACA GAT CCA 2003 

645 Thr Gly Thr Thr Pro Ser Val Phe Ser Trp Tyr Lys Thr Lys Phe Asp Ala Pro Gly Gly Thr Asp Pro 667 

2004 GTT GOT CTT GAT TTT AGT AGC ATG GGA AAA GGT CAG GCA TGG GTT AAT GGC CAC CAT GTA GGA AGA TAT 2072 

668 Val Ala Leu Asp Phe Ser Ser Met Gly Lys Gly Gin Ala Trp Val Asn Gly His His Val Gly Arg Tyr 690 

2073 TGG ACT TTG GTT GCA CCA AAT AAT GGA TCT GGA AGA ACT TGT GAT TAT CGT GGT GCT TAC CAC TCT GAT 2141 

691 Trp Thr Leu Val Ala Pro Asn Asn Gly Cys Gly Arg Thr Cys Asp Tyr Arg Gly Ala Tyr His Ser Asp 713 

2142 AAA TGT AGG ACA AAC TGT GGA GAG ATT ACT CAG GCC TGG TAC CAT ATA CCT AGA TCA TGG CTA AAG ACA 2210 

714 Lys Cys Arg Thr Asn Cys Gly Glu He Thr Gin Ala Trp Tyr His He Pro Arg Ser Trp Leu Lys Thr 736 

2211 TTA AAT AAT GTA CTA GTT ATC TTT GAA GAA ACA GAT AAA ACT CCG TTT GAT ATT TCC ATT TCT ACG CGT 2279 

737 Leu Asn Asn Val Leu Val He Phe Glu Glu Thr Asp Lys Thr Pro Phe Asp He Ser He Ser Thr Arg 759 

2280 TCT ACT GAA ACC ATT TGT GCT CAA GTA TCG GAA AAG CAC TAT CCA CCT CTA CAT AAG TGG TCT CAT TCG 2348 

760 Ser Thr Glu Thr He Cys Ala Gin Val Ser Glu Lys His Tyr Pro Pro Leu His Lys Trp Ser His Ser 782 

2349 GAG TTT GAC AGA AAG TTG TCT CTG ATG GAT AAA ACA CCA GAA ATG CAC TTG CAG TGT GAC GAA GGA CAT 2417 

783 Glu Phe Asp Arg Lys Leu Ser Leu Met Asp Lys Thr Pro Glu Met His Leu Gin Cys Asp Glu Gly His 805 

2418 ACA ATC TCT TCT ATT GAA TTT GCA AGC TAT GGA AGT CCG AAT GGC AGC TGT CAA AAG TTC TCA CAA GGA 2486 

806 Thr He Ser Ser He Glu Phe Ala Ser Tyr Gly Ser Pro Asn Gly Ser Cys Gin Lys Phe Ser Gin Gly 828 

2487 AAA TGC CAT GCT GCA AAT TCC TTG TCT GTT GTA TCT CAG GCT TGT ATA GGA AGA ACT AGT TGC AGC ATT 2555 

829 Lys Cys His Ala Ala Asn Ser Leu Ser Val Val Ser Gin Ala Cys He Gly Arg Thr Ser Cys Ser lie 851 

2556 GGC ATT TCC AAT GGT GTA TTT GGA GAT CCA TGT CGA CAC GTT GTG AAG AGT TTG GCT GTT CAA GCA AAA 2624 

852 Gly He Ser Asn Gly Val Phe Gly Asp Pro Cys Arg His Val Val Lys Ser Leu Ala Val Gin Ala Lys 874 

262S TGC TCA CCA CCA CCA GAC CTC AGC ACT TCA GCT TCC TCG TGA GGAGACTCTGGTAACACGTTAACCT 2702 

875 Cys Ser Pro Pro Pro Asp Leu Ser Thr Ser Ala Ser Ser *** 888 

2703 ACGATCCCTTAAAGTCCACTCGTTC 2794 

2795 ATGGACGATTTTACTTGTAAATATTTGGT^ 2866 

2887 AGTAACAAATGAAAATAGAAAACTCCTGT^ 2978 

2979 AAAAAA 2984 
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1 
31 


AGAGTTCATTAT'l'iU^"I , TTGCA , I"l w ri , GAAA 

AAGAGGAAAAAAATAAAGTTAAAGGGGGGGGAAAAAGTTT^ 


30 
121 


122 
1 


ATG 
Met 


GGT 
Gly 


TGT 
Cys 


ACG 
Thr 


CTT 
Leu 


ATA CTA ATG TTG AAT GTG TTG TTG GTG TTG TTG GGT TCA TGG GTT TTT TCT GGA 
He Leu Met Leu Asn Val Leu Leu Val Leu Leu Gly Ser Trp Val Phe Ser Gly 


190 

23 


191 
24 


ACA 
Thr 


GCT 
Ala 


TCT 
Ser 


GTT 
Val 


TCA 
Ser 


TAT GAC CAT AGG GCT ATT ATT GTA AAT GGA CAA AGA AGA ATA CTT ATT TCT GGT 
Tyr Asp His Arg Ala He He Val Asn Gly Gin Arg Arg lie Leu He Ser Gly 


259 
46 


260 
47 


TCT 
Ser 


GTT 
Val 


CAT 
His 


TAT 
Tyr 


CCA 
Pro 


AGA AGC ACT CCT GAG ATG TGG CCA GGT ATT ATT CAA AAG GCT AAA GAA GGA GGT 
Arg Ser Thr Pro Glu Met Trp pro Gly He He Gin Lys Ala Lys Glu Gly Gly 


328 
69 


70 


GTG 

Val 


GAT 
Asp 


GTG 
Val 


ATT 
lie 


CAG 
Gin 


ACT TAT GTT TTC TGG AAT GGA CAT GAG CCT CAA CAA GGG AAA TAT TAT TTT GAA 
Thr Tyr Val Phe Trp Asn Gly His Glu Pro Gin Gin Gly Lys Tyr Tyr Phe Glu 


397 
92 


93 


GGG 
Gly 


AGA 
Arg 


TAT 
Tyr 


GAT 
Asp 


TTA 

Leu 


GTG AAG TTT ATT AAG CTG GTG CAC CAA GCA GGA CTT TAT GTC CAT CTT AGA GTT 
Val Lys Phe He Lys Leu Val His Gin Ala Gly Leu Tyr Val His Leu Arg Val 


466 
115 


116 


GGA 
Gly 


CCT 
Pro 


TAT 

Tyr 


GCT 
Ala 


TGT 
Cys 


GCT GAA TGG AAT TTT GGG GGC TTT CCT GTT TGG CTG AAA TAT GTT CCA GGT ATC 
Ala Glu Trp Asn Phe Gly Gly Phe Pro Val Trp Leu Lys Tyr Val Pro Gly He 


535 
138 


536 
139 


AGT 
Ser 


TTC 
Phe 


AGA 
Arg 


ACA 
Thr 


GAT 
Asp 


AAT GGA CCT TTC AAG GCT GCA ATG CAA AAA TTT ACT GCC AAG ATT GTC AAT ATG 
Asn Gly Pro Phe Lys Ala Ala Met Gin Lys Phe Thr Ala Lys He Val Asn Met 


604 
161 


605 
162 


ATG 
Met 


AAA 
Lys 


GCG 
Ala 


GAA 
Glu 


CGT 
Arg 


TTG TAT GAA ACT CAA GGG GGG CCA ATA ATT TTA TCT CAG ATT GAG AAT GAA TAT 
Leu Tyr Glu Thr Gin Gly Gly Pro He He Leu Ser Gin lie Glu Asn Glu Tyr 


673 
184 


674 
185 


GGA 
Gly 


CCC 
Pro 


ATG 
Met 


GAA 
Glu 


TGG 
Trp 


GAA CTG GGA GCA CCA GGT AAA TCT TAC GCA CAG TGG GCC GCC AAA ATG GCT GTG 
Glu Leu Gly Ala Pro Gly Lys Ser Tyr Ala Gin Trp Ala Ala Lys Met Ala Val 


742 
207 


743 
208 


GGT 

Gly 


CTT 
Leu 


GAC 
Asp 


ACT 
Thr 


GGT 

Gly 


GTC CCA TGG GTT ATG TGC AAG CAA GAC GAT GCC CCT GAT CCT ATT ATA AAT GCT 
Val Pro Trp Val Met Cys Lys Gin Asp Asp Ala Pro Asp Pro He He Asn Ala 


811 

230 


812 
231 


TGC 

Cys 


AAT 
Asn 


GGC 
Gly 


TTC 
Phe 


TAC 
Tyr 


TGT GAC TAC TTT TCT CCA AAC AAG GCT TAT AAA CCA AAG ATA TGG ACT GAA GCC 
Cys Asp Tyr Phe Ser Pro Asn Lys Ala Tyr Lys Pro Lys He Trp Thr Glu Ala 


880 
253 


881 
254 


TGG 
Trp 


ACT 
Thr 


GCA 
Ala 


TGG 
Trp 


TTT 
Phe 


ACT GGT TTT GGA AAT CCA GTT CCT TAC CGT CCT GCT GAG GAC TTG GCA TTT TCT 
Thr Gly Phe Gly Asn Pro Val Pro Tyr Arg Pro Ala Glu Asp Leu Ala Phe ser 


949 
276 


OKA 

277 


GTT 
Val 


GCA 
Ala 


AAA 
Lys 


TTT 
Phe 


ATA 
He 


CAG AAG GGA GGT TCC TTC ATC AAT TAT TAC ATG TAT CAT GGA GGA ACA AAC TTT 
Gin Lys Gly Gly Ser Phe He Asn Tyr Tyr Met Tyr His Gly Gly Thr Asn Phe 


1018 
299 


1019 
300 


GGA 
Gly 


CGG 
Arg 


ACT 
Thr 


GCT 
Ala 


GGT 

Gly 


GGT CCA TTT ATT GCT ACT AGT TAT GAC TAT GAT GCA CCA CTT GAT GAA TAT GGA 
Gly Pro Phe He Ala Thr Ser Tyr Asp Tyr Asp Ala Pro Leu Asp Glu Tyr Gly 


1087 
322 


1088 
323 


TTA 

Leu 


TTG 
Leu 


CGA 
Arg 


CAA 
Gin 


CCA 
Pro 


AAA TGG GGT CAC CTG AAA GAT CTG CAT AGA GCA ATA AAG CTT TGT GAA CCA GCT 
Lys Trp Gly His Leu Lys Asp Leu His Arg Ala He Lys Leu Cys Glu Pro Ala 


1156 
345 


1157 
346 


TTA 

Leu 


GTC 
Val 


TCT 
Ser 


GGA 

Gly 


GAT 
Asp 


CCA GCT GTG ACA GCA CTT GGA CAC CAG CAG GAG GCC CAT GTT TTT AGG TCG AAG 
Pro Ala Val Thr Ala Leu Gly His Gin Gin Glu Ala His Val Phe Arg Ser Lys 


1225 
368 


1226 
369 


GOT 
Ala 


GGC 
Gly 


TCT 
Ser 


TGT 
Cys 


GCT 
Ala 


GCA TTC CTT GCT AAC TAC GAC CAA CAC TCT TTT GCT ACT GTG TCA TTT GCA AAC 
Ala Phe Leu Ala Asn Tyr Asp Gin His Ser Phe Ala Thr Val Ser Phe Ala Asn 


1294 
391 


1295 
392 


AGG 
Arg 


CAT 
His 


TAC 
Tyr 


AAC 
Asn 


TTG 
Leu 


CCA CCA TGG TCA ATC AGC ATT CTT CCC GAC TGC AAG AAC ACT GTA TTT AAT ACA 
Pro Pro Trp Ser He Ser He Leu Pro Asp Cys Lys Asn Thr Val Phe Asn Thr 


1363 
414 


1364 
415 


GCA 
Ala 


CGG 
Arg 


ATC 
lie 


GGT 
Gly 


GCT 
Ala 


CAA AGT GCT CAG ATG AAG ATG ACT CCA GTC AGC AGA GGA TTG CCC TGG CAG TCA 
Gin Ser Ala Gin Met Lys Met Thr Pro Val Ser Arg Gly Leu Pro Trp Gin Ser 


1432 
437 


1433 
438 


TTC 
Phe 


AAT 

Asn 


GAA 
Glu 


GAG 
Glu 


ACA 
Thr 


TCA TCT TAT GAA GAC AGT AGT TTT ACA GTT GTT GGG CTA TTG GAA CAG ATA AAT 
Ser Ser Tyr Glu Asp Ser Ser Phe Thr Val Val Gly Leu Leu Glu Gin He Asn 


1501 
460 
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1502 ACA ACA AGA GAC GTG TCT GAT TAT TTG TOG TAT TCA ACA GAT GTC AAG ATT GAT TCA AGA GAA AAG TTT 1570 

461 Thr Thr Arg Asp Val Ser Asp Tyr Leu Trp Tyr Ser Thr Asp Val Lys lie Asp Ser Arg Glu Lys Phe 483 

1571 TTG AGA GGC GGA AAA TGG CCT TGG CTT ACG ATC ATG TCA GOT GGG CAT GCA TTG CAT GTT TTT GTG AAT 1639 

464 Leu Arg Gly Gly Lys Trp Pro Trp Leu Thr lie Met Ser Ala Gly His Ala Leu His Val Phe Val Asn 506 

1640 GGT CAA TTA GCA GGA ACT GCA TAT GGA ACT TTA GAA AAA CCG AAA CTA ACT TTC AGT AAA <2CC GTA AAT 1706 

507 Gly Gin Leu Ala Gly Thr Ala Tyr Gly Ser Leu Glu Lys Pro Lys Leu Thr Phe Ser Lys Ala Val Asn 529 

1709 CTG AGA GCA GGT GTT AAC AAG ATT TCT CTA CTG AGC ATT OCT GTT GGC CTT CCG AAT ATC GGC CCA CAT 1777 

530 Leu Arg Ala Gly Val Asn Lys He Ser Leu Leu Ser He Ala Val Gly Leu Pro Asn He Gly Pro His 552 

1778 TTT GAG ACA TGG AAT GCT GGT GTT CTT GGG CCA GTC TCA CTA ACT GGT CTT GAC GAG GGG AAA AGA GAT 1B46 

553 Phe Glu Thr Trp Asn Ala Gly Val Leu Gly Pro Val Ser Leu Thr Gly Leu Asp Glu Gly Lys Arg Asp 575 

1847 TTA ACA TGG CAG AAA TGG TCT TAC AAG GTT GGT CTA AAA GGA GAA GCC TTG AGC CTC CAT TCA CTC AGT 1915 

576 Leu Thr Trp Gin Lys Trp Ser Tyr Lys Val Gly Leu Lys Gly Glu Ala Leu ser Leu His Ser Leu Ser 598 

1916 GGT AGC TCG TCA GTT GAG TGG GTC GAG GGT TCT TTA GTG GCT CAG AGA CAG CCA CTC ACA TGG TAC AAG 1984 

599 Gly Ser Ser Ser Val Glu Trp Val Glu Gly Ser Leu Val Ala Gin Arg Gin Pro Leu Thr Trp Tyr Lys 621 

1985 AGC ACT TTT AAT GCT CCA GCT GGA AAT GAT CCT TTG GCT TTA GAC TTG AAT ACC ATG GGC AAA GGA CAA 2053 

622 Ser Thr Phe Asn Ala Pro Ala Gly Asn Asp Pro Leu Ala Leu Asp Leu Asn Thr Met Gly Lys Gly Gin 644 

2054 GTG TGG ATA AAT GGT CAA AGC CTC GGA CGC TAT TGG CCT GGA TAT AAA GCA TCT GGT AAC TGC GGT GCC 2122 

645 Val Trp He Asn Gly Gin Ser Leu Gly Arg Tyr Trp Pro Gly Tyr Lys Ala Ser Gly Asn Cys Gly Ala 667 

2123 TGT AAC TAT GCA GGC TGG TTT AAT GAG AAA AAA TGC CTA AGT AAC TGT GGA GAG GCT TCA CAA CGA TGG 2191 

668 Cys Asn Tyr Ala Gly Trp Phe Asn Glu Lys Lys Cys Leu Ser Asn Cys Gly Glu Ala Ser Gin Arg Trp 690 

2192 TAT CAT GTT CCC CGT TCT TGG CTG TAT CCT ACT GGA AAT TTG TTA GTT CTA TTT GAG GAA TGG GGA GGA 2260 

691 Tyr His Val Pro Arg Ser Trp Leu Tyr Pro Thr Gly Asn Leu Leu Val Leu Phe Glu Glu Trp Gly Gly 713 

2261 GAG CCT CAT GGA ATC TCT TTG GTA AAA AGA GAA GTT GCA AGT GTT TGT GCA GAT ATA AAC GAA TGG CAA 2329 

714 Glu Pro His Gly He Ser Leu Val Lys Arg Glu Val Ala Ser Val Cys Ala Asp He Asn Glu Trp Gin 736 

2330 CCA CAG TTG GTG AAT TGG CAA ATG CAA GCA TCT GGT AAA GTT GAC AAA CCA CTG AGA CCT AAA GCT CAC 2398 

737 Pro Gin Leu Val Asn Trp Gin Met Gin Ala Ser Gly Lys Val Asp Lys Pro Leu Arg Pro Lys Ala His 759 

2399 CTC TCG TGT GCT TCT GGT CAG AAG ATT ACT TCA ATC AAA TTT GCA AGC TTT GGA ACA CCA CAA GGG GTC 2467 

760 Leu Ser Cys Ala Ser Gly Gin Lys He Thr Ser He Lys Phe Ala Ser Phe Gly Thr Pro Gin Gly Val 782 

2468 TGC GGA AGC TTC CGT GAA GGA AGC TGC CAC GCC TTC CAC TCA TAT GAT GCT TTT GAA AGG TAT TGC ATC 2536 

783 Cys Gly Ser phe Arg Glu Gly Ser Cys His Ala Phe His Ser Tyr Asp Ala Phe Glu Arg Tyr Cys He 805 

2537 GGG CAA AAC TCG TGC TCA GTA CCT GTA ACA CCA GAG ATC TTT GGA GGT GAT CCA TGT CCA CAT GTT ATG 2605 

806 Gly Gin Asn Ser Cys Ser Val Pro Val Thr Pro Glu He Phe Gly Gly Asp Pro Cys Pro His Val Met 828 

2606 AAG AAA CTC TCA GTT GAG GTT ATT TGC AGT TGA TGACACTGAGGAGAAACAAATAAAAGTGGTT^ 2686 

829 Lys Lys Leu Ser Val Glu Val He Cys Ser *** 840 

2687 CATATCAAAAAGTTGGCTTTGATGGAGGTQA 2778 

2779 GATTCTX3TACATATATGTTGGTT 2870 

2871 ATTGTGCTAGTGGGAGGTAGTAGGTACCGATCGATCTATC^ 2962 

2963 ACAATGAGACTGATTCTTGATTAGTCCATGTCT 3054 

3055 AAAAAAAAAAAAAAA 3069 
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one /clone came: TB04 /pZBG2- xqB**pTom$a/«l4; accession number AF02 03^^ Sequence XO number 4 



1 AAAAAAAqTTT<^ Vmvr TTCTAAAAT^^ 63 

64 ATG CTA AGG ACT AAT GTG TTG TTG TTA TTA GTT ATT TGT TTA TTG GAT TTT TTT TCT TCA GTG AAA GCT 132 

1 Met Leu Arq Thr Asn Val Leu Leu Leu Leu VaX lie Cys Leu Leu Asp Phe Phe Ser Ser Val Lys Ala 23 

133 ACT GTT TCT TAT GAT GAC AGA GCT ATA ATC ATA AAT GGG AAA AGA AAA ATT CTT ATT TCT GST TCA ATT 201 

24 Ser Val Ser Tyr Asp Asp Arg Ala lie lie lie Asn Gly Lys Arg Lys He Leu He Ser Gly Ser lie 46 

202 CAT TAT CCA AGA AGC ACT CCA CAG ATG TGG OCT GAT CTT ATA CAA AAG GCT AAA GAT GGA GGC TTA GAT 270 

4? His Tyr Pro Arg Ser Thr Pro Gin Met Trp Pro Asp Leu He Gin Lys Ala Lys Asp Gly Gly Leu Asp 69 

271 GTT ATT GAA ACT TAT GTT TTC TGG AAT GGA CAT GAG CCT TCT CCT GGA AAA TAT AAT TTT GAA GGA AGA 339 

70 Val He Glu Thr Tyr Val Phe Trp Asn Gly His Glu Pro Ser Pro Gly Lys Tyr Asn Phe Glu Gly Arg 92 

340 TAT GAT CTT GTT AGA TTC ATC AAA ATG GTA CAA AGA GCA GGA CTT TAT GTC AAT TTA CGT ATT GGC CCT 408 

93 Tyr Asp Leu Val Arg Phe He Lys Met Val Gin Arg Ala Gly Leu Tyr Val Asn Leu Arg He Gly Pro 115 

409 TAC GTC TGT GCT GAA TGG AAC TTT GGG GGA TTC CCT GTT TGG CTA AAA TAT GTG CCT GGT ATG GAA TTT 477 

116 Tyr Val Cys Ala Glu Trp Asn Phe Gly Gly Phe Pro Val Trp Leu Lys Tyr Val Pro Gly Met Glu Phe 136 

478 AGA ACA AAC AAT CAG CCT TTT AAG GTG GCT ATG CAA GGA TTT GTT CAG AAA ATA GTC AAC ATG ATG AAG 546 

139 Arg Thr Asn Asn Gin Pro Phe Lys Val Ala Met Gin Gly Phe Val Gin Lys He Val Asn Met Met Lys 161 

547 TCA GAA AAT TTG TTT GAA TCT CAA GGA GGA CCA ATA ATT ATG GCC CAG ATA GAA AAT GAG TAT GGA CCA 615 

162 Ser Glu Asn Leu Phe Glu Ser Gin Gly Gly Pro He He Met Ala Gin He Glu Asn Glu Tyr Gly Pro 184 

616 GTA GAA TGG GAA ATT GGT GCT CCT GGT AAA GCT TAT ACA AAA TGG GCA GCT CAA ATG GCT GTA GGT TTG 684 

185 Val Glu Trp Glu He Gly Ala Pro Gly Lys Ala Tyr Thr Lys Trp Ala Ala Gin Met Ala Val Gly Leu 207 

685 AAA ACT GGT GTC CCA TGG ATC ATG TGT AAG CAA GAG GAT GCT CCT GAT CCT GTG ATT GAT ACT TGT AAT 753 

208 Lys Thr Gly Val Pro Trp He Met Cys Lys Gin Glu Asp Ala Pro Asp Pro Val He Asp Thr Cys Asn 230 

754 GGC TTC TAC TGC GAA GGG TTC CGT CCT AAT AAG CCT TAC AAA CCT AAA ATG TGG ACA GAA GTA TGG ACT 822 

231 Gly Phe Tyr Cys Glu Gly Phe Arg Pro Asn Lys Pro Tyr Lys Pro Lys Met Trp Thr Glu Val Trp Thr 253 

823 GGC TGG TAT ACG AAA TTC GGT GGT CCA ATT CCT CAA AGA CCA GCC GAA GAC ATT GCA TTT TCA GTT GCC 891 

254 Gly Trp Tyr Thr Lys Phe Gly Gly Pro lie Pro Gin Arg Pro Ala Glu Asp He Ala Phe Ser Val Ala 276 

892 AGG TTT GTT CAG AAC AAT GGT TCA TTC TTC AAT TAC TAC ATG TAT CAT GGA GGA ACA AAT TTT GGC CGG 960 

277 Arg Phe Val Gin Asn Asn Gly Ser Phe Phe Asn Tyr Tyr Met Tyr His Gly Gly Thr Asn Phe Gly Arg 299 

961 ACA TCA TCA GGG CTT TTC ATT GCA ACT AGC TAC GAT TAT GAT GCT CCT CTC GAT GAA TAT GGG TTG CTG 1029 

300 Thr Ser Ser Gly Leu Phe He Ala Thr Ser Tyr Asp Tyr Asp Ala Pro Leu Asp Glu Tyr Gly Leu Leu 322 

1030 AAT GAA CCA AAG TAT GGG CAC TTG AGA GAC TTA CAT AAA GCT ATC AAG CTA TCT GAA CCG GCT TTA GTT 1098 

323 Asn Glu Pro Lys Tyr Gly His Leu Arg Asp Leu His Lys Ala He Lys Leu Ser Glu Pro Ala Leu Val 345 

1099 TCA TCA TAT GCT GCG GTG ACT AST CTT GGA AGT AAT CAA GAG GCT CAT GTT TAT AGA TCA AAA TCT GGA 1167 

346 Ser Ser Tyr Ala Ala Val Thr Ser Leu Gly Ser Asn Gin Glu Ala His Val Tyr Arg Ser Lys Ser Gly 368 

1168 GCT TGT GCT GCT TTT TTA TCC AAC TAT GAC TCT AGA TAT TCA GTA AAA GTC ACC TTT CAG AAT AGG CCA 1236 

369 Ala Cys Ala Ala Phe Leu Ser Asn Tyr Asp Ser Arg Tyr Ser Val Lys Val Thr Phe Gin Asn Arg Pro 391 

1237 TAC AAT CTG CCT CCA TGG TCC ATC AGC ATT CTT CCC GAC TGC AAA ACT GCC GTT TAC AAC ACT GCA CAG 1305 

392 Tyr Asn Leu Pro Pro Trp Ser lie Ser He Leu Pro Asp Cys Lys Thr Ala Val Tyr Asn Thr Ala Gin 414 

1306 GTT AAC TCT CAA AGC TCG AGC ATA AAG ATG ACG CCT GCA GGT GGT GGA TTG TCT TGG CAG TCA TAC AAT 1374 

415 Val Asn Ser Gin Ser Ser Ser He Lys Met Thr Pro Ala Gly Gly Gly Leu Ser Trp Gin Ser Tyr Asn 437 

1375 GAA GAA ACG CCT ACT GCT GAT GAC AGC GAT ACA CTT ACA GCT AAC GGA CTA TGG GAA CAG AAA AAC GTC 1443 

438 Glu Glu Thr Pro Thr Ala Asp Asp Ser Asp Thr Leu Thr Ala Asn Gly Leu Trp Glu Gin Lys Asn Val 460 
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Gene /clou© name: TBQ4/pZBG2 JJpi/pTompfifal*; accession number AF0203 I VHP Ssqueace XD number 4 

oont. 

1444 ACA AGA GAT TCA TCA GAC TAT CTG TGG TAC ATG ACA AAT GTA AAT ATA GCA TCT AAT GAA GGA TTT CTA 1512 

451 Thr Arg Asp Ser Ser Asp Tyr lieu Tip Tyr Met Thr Asn Val Asn lie Ala Ser Asn Glu Gly Phe Leu 483 

1513 AAG AAC GGA AAG GAT CCT TAT CTC ACT GTT ATG TCC GOT GGT CAT GTC TTG CAT GTT TTC GTC AAT GGA 1581 

484 Lys Asn Gly Lys Asp Pro Tyr Leu Thr Val Met Ser Ala Gly His Val Leu His Val Phe Val Asn Gly 506 

1582 AAA CTA TCA GGA ACT GTT TAT GGT ACA TTG GAT AAT CCA AAA CTT ACA TAC ACT GGC AAC -GTG AAG TTA 1650 

507 Lys Leu Ser Gly Thr Val Tyr Gly Thr Leu Asp Asn Pro Lys Leu Thr Tyr Ser Gly Asn Val Lys Leu 529 

1651 AGA GCT GGT ATT AAC AAG ATT TCT CTG CTC AGT GTT TCC GTT GGT CTC CCG AAC GTT GGC GTG CAT TAT 1719 

530 Arg Ala Gly lie Asn Lys lie Ser Leu Leu Ser Val Ser Val Gly Leu Pro Asn Val Gly Val His Tyr 552 

1720 GAT ACA TGG AAT GCA GGA GTT CTA GGT CCA GTC ACG TTG AGC GGT CTC AAT GAA GGG TCA AGA AAC TTG 1788 

553 Asp Thr Trp Asn Ala Gly Val Leu Gly Pro Val Thr Leu Ser Gly Leu Asn Glu Gly Ser Arg Asn Leu 575 



1789 GCG AAA CAG AAA TGG TCT TAC AAG GTT GGT CTG AAA GGC GAA TOG TTA AGT CTT CAC TCC TTA AGT GGG 1857 
576 Ala Lys Gin Lys Trp Ser Tyr Lys Val Gly Leu Lys Gly Glu Ser Leu Ser Leu His Ser Leu Ser Gly 598 



1858 


AGT 


TCT 


TCT GTT GAA TGG GTT CGA GGT TCA CTA ATG GCT CAA AAG CAG CCC CTG ACT TGG TAC AAG GCT 


1926 


599 


Ser 


Ser 


Ser Val Glu Trp Val Arg Gly Ser Leu Met Ala Gin Lys Gin Pro Leu Thr Trp Tyr Lys Ala 


622 


1927 


ACA 


TTT 


AAC GCG CCT GGA GGA AAT GAT CCA CTA GCT TTA GAC ATG GCA AGT ATG GGA AAA GGT CAG ATA 


1995 


622 


Thr 


Phe 


Asn Ala Pro Gly Gly Asn Asp Pro Leu Ala Leu Asp Met Ala Ser Met Gly Lys Gly Gin He 


644 


1996 


TGG 


ATA 


AAT GGT GAA GGC GTA GGT CGC CAT TGG CCT GGA TAC ATA GCA CAA GGC GAC TGC AGC AAA TGC 


2064 


645 


Trp 


He 


Asn Gly Glu Gly Val Gly Arg His Trp Pro Gly Tyr He Ala Gin Gly Asp Cys Ser Lys Cys 


667 


2065 


AGT 


TAT 


GCT GGA ACG TTC AAC GAG AAG AAG TGC CAG ACT AAC TGC GGA CAA CCT TCT CAG AGA TGG TAC 


2133 


668 


Ser 


Tyr 


Ala Gly Thr Phe Asn Glu Lys Lys Cys Gin Thr Asn Cys Gly Gin Pro Ser Gin Arg Trp Tyr 


690 


2134 


CAT 


GTT 


CCA CGA TCG TGG CTG AAA CCA AGT GGA AAC TTG TTA GTA GTA TTC GAA GAA TGG GGA GGT AAT 


2202 


691 


His 


Val 


Pro Arg Ser Trp Leu Lys Pro Ser Gly Asn Leu Leu Val Val Phe Glu Glu Trp Gly Gly Asn 


713 


2203 


CCA 


ACA 


GGA ATT TCT CTA GTC AGG AGA TCA AGA TAA AGAACTCGAAAAGTAAAACTTGTTC 


2282 


714 


Pro 


Thr 


Gly He Ser Leu Val Arg Arg Ser Arg *** 


725 



2283 TTCGCGCCGAAAAATACATACACGAAGCTAA 2374 
2375 TGATTAAAAGGAGTATATAAATTTACAGAGAAT^ 2466 
2467 GGATTATGAGATTGAAGAAGATTGTACAGCTTCCAA 2554 
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Ctone/clone naaei TB05/R*R2-l/bl, aecat.ion number AT154423 ^«»«c« ID number 5 



1 ATC CAG ACT TAC GTT TIC TOG AAC CTT CAT GAA CCT GTT CGA AAT CAG TAT GAT TTT GAA OGA AGG AAA 69 
1 lie Gin Thr Tyr Val Phe Trp Asn Leu His Glu Pro Val Arg Asn Gin Tyr Asp Phe Glu Gly Arg Lys 23 

70 GAT TTG ATT • AAT TTT GTG AAG TTG GTG GAG AGA GCT GGC TTA TTT GTT CAT ATA AGG ATT GGG OCT TAT 138 

24 Asp Leu lie Asn Phe Val Lys Leu Val Glu Arg Ala Gly Leu Phe Val His He Arg lie Gly Pro Tyr 

139 GTT TGT GCA GAA TGG AAC TAT GGT GGG TTT CCT CTT TOG TTG CAT TTC ATT CCT GGA ATT GAA TTT CGA 
47 val Cys Ala Glu Trp Asn Tyr Gly Gly Phe Pro Leu Trp Leu His Phe He Pro Gly He Glu Phe Arg 

208 ACC GAC AAT GAA CCG TTC AAG GCA GAA ATG AAG CGA TTC ACA GCT AAA ATT GTT GAC ATG ATC AAG CAA 
70 Thr Asp Asn Glu Pro Phe Lys Ala Glu Met Lys Arg Phe Thr Ala Lys He Val Asp Met He Lys Gin 



277 



GAA AAT CTA TAT GCA TCC CAG GGT GGG CCG GTT ATC TTG TCT CAG ATA GAA AAT GAG TAT GGC AAT GGT 



46 



207 
69 



276 

92 



345 



414 

138 



483 
161 



93 Glu Asn Leu Tyr Ala Ser Gin Gly Gly Pro Val He Leu Ser Gin He Glu Asn Glu Tyr Gly Asn Gly 115 

346 GAT ATT GAG TCT CGT TAT GGT CCT CGT GCC AAA CCT TAC GTG AAC TGG GCA GCA TCA ATG GCT ACG TCT 
116 Asp He Glu Ser Arg Tyr Gly Pro Arg Ala Lys Pro Tyr Val Asn Trp Ala Ala Ser Met Ala Thr Ser 

415 TTA AAT ACG GGA GTG CCA TGG GTT ATG TGT CAG CAA CCA GAT GCC CCT CCT TCC GTT ATT AAC ACT TGC 
139 Leu Asn Thr Gly Val Pro Trp Val Met Cys Gin Glu Pro Asp Ala Pro Pro Ser Val He Asn Thr Cys 

484 AAT GGA TTT TAT TGT GAC CAA TTC AAG CAA AAT TCC GAT AAA ACA CCC AAG ATG TGG ACT GAG AAT TGG 552 
162 Asn Gly Phe Tyr Cys Asp Gin Phe Lys Gin Asn Ser Asp Lys Thr Pro Lys Met Trp Thr Glu Asn Trp 

553 ACC GGA TGG TTT CTG TCG TTT GGT GGT CCT GTC CCT TAC AGA CCA GTG GAA GAC ATC GCT TTC GCT GTG 
185 Thr Gly Trp Phe Leu Ser Phe Gly Gly Pro Val Pro Tyr Arg Pro Val Glu Asp He Ala Phe Ala Val 

622 GCT CGA TTT TTC CAG CGA GGC GGA ACT TTC CAG AAC TAT TAC ATG TAC CAC GGG GGA ACT AAC TTT GOG 
208 Ala Arg Phe Phe Gin Arg Gly Gly Thr Phe Gin Asn Tyr Tyr Met Tyr His Gly Gly Thr Asn Phe Gly 

691 AGA ACC ACT GGT GGA CCG TTT ATT GCA ACT AGC TAT GAC TAT GAT GCC CCT CTC GAC GAA TAC GG 
231 Arg Thr Ser Gly Gly Pro Phe He Ala Thr Ser Tyr Asp Tyr Asp Ala Pro Leu Asp Glu Tyr 



184 



621 

207 



690 
230 



755 
252 
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' Gene /clone name: TBG6/RT4Bl2-6/bl; accession number AF154424 ; ^Kuehce ~k&* xfua&rir * 



1 ATC CAG ACA TAT GTT TTT TGG AAT GTT CAT GAG CCT TCT CCT GGC AAT TAC AAT TTT GAA GGA AGA TAT 69 

1 lie Gin Thr Tyr Val Phe Trp Asn Val His Glu Pro Ser Pro Gly Asn Tyr Asn Phe Glu Gly Arg Tyr 23 

70 GAC CTG GTG AGG TTT GTA AAA ACG ATT CAG AAA GCA GGG CTG TAT GCT CAT CTT CGA ATT GGC CCT TAC 138 

24 Asp lieu Val Arg Phe Val Lys Thr lie Gin Lys Ala Gly Leu Tyr Ala His Leu Arg lie, Gly Pro Tyr 46 

139 GTT TGT GCA GAG TGG AAT TTT GGA GGG TTT CCA GTA TGG CTG AAG TAT GTA CCT GGC ATT AGO TTC AGA 207 

47 Val Cys Ala Glu Trp Asn Phe Gly Gly Phe Pro Val Trp Leu Lys Tyr Val Pro Gly He Ser Phe Arg 69 

208 GCT GAT AAT GAA CCT TTC AAG AAC GCA ATG AAA GGG TAT GCT GAG AAA ATT GTT AAC TTG ATG AAG ATC 276 

70 Ala Asp Asn Glu Pro Phe Lys Asn Ala Met Lys Gly Tyr Ala Glu Lys He Val Asn Leu Met Lys He 92 

277 ATA ATC TTT TCG ACT CTC AGG GTG GTC CAA TCA TAC TCT GAC AGA TTG AGA ATG AGT ATG GGC CTC AAG 345 

93 He He Phe Ser Ser Leu Arg Val Val Gin Ser Tyr Ser His Arg Leu Arg Met Ser Met Gly Leu Lys 115 

346 CCA AGG TAC TTG GAG CAC COG GAC ATC AGT ATT CAA CAT GGG CTG CAA ATA TGG CAG TTG GAT TTG AAC 414 

116 Pro Arg Tyr Leu Glu His Arg Asp He Ser He Gin His Gly Leu Gin He Trp Gin Leu Asp Leu Asn 136 

415 ACA GGC GTC CCA TGG GTG ATG TGC AAG GAA GAA GAT GCA CCA GAT CCT GTG ATC AAC ACA TGC AAT GGT 483 

139 Thr Gly Val Pro Trp Val Met Cys Lys Glu Glu Asp Ala Pro Asp Pro Val He Asn Thr Cys Asn Gly 161 

484 TTC TAC TGT GAT AAT TTC TTC CCA AAC AAA CCA TAC AAA CCT GCA ATT TGG ACT GAA GCT TGG AGT GGA 552 

162 Phe Tyr Cys Asp Asn Phe Phe Pro Asn Lys Pro Tyr Lys Pro Ala He Trp Thr Glu Ala Trp Ser Gly 184 

553 TGG TTC, TCG GAA TTT GGC GGT CCC CTT CAT CAG AGA CCA GTT CAG GAT TTG GCA TTT GCT GTT GCC CAA 621 

IBS Trp Phe Ser Glu Phe Gly Gly Pro Leu His Gin Arg Pro Val Gin Asp Leu Ala Phe Ala Val Ala Gin 207 

622 TTT ATA CAA AGA GGA GGA TCT TTT GTT AAC TAT TAC ATG TAC CAT GGG GGC ACG AAC TTT GGA CGC ACT 690 

208 Phe He Gin Arg Gly Gly Ser Phe Val Asn Tyr Tyr Met Tyr His Gly Gly Thr Asn Phe Gly Arg Thr 230 

691 GCG GGT GGG CCA TTC ATC ACT ACC AGC TAT GAT TAT GAT GCC CCC CTC GAC GAG TAT GG 749 

231 Ala Gly Gly Pro Phe He Thr Thr Ser Tyr Asp Tyr Asp Ala Pro Leu Asp Glu Tyr 250 
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Ctttna/clone najae ; TBG7 /]^fcp2 - 1- 18 ; accession number A7154422; ^[|^^t— -~ -»-w timb er 7 

i gca&c*k*tc<Ss «- w tz 

13 GTGAATAACACCGGTAAACQGCCAATGC^^ 103 

104 ATG AAC ACA ATG AGT TGT TTG TCC TCT AAT TTC AAG TTC GTT TTC CTT GCC TCG ACT GTG ATA TGG ATG 172 

1 Met Asn Thr Met Ser Cys Leu Ser Ser Asn Phe Lys Phe Val Phe Leu Ala Sex Thr Val He Trp Met 23 

173 ACG GTA ATG* TOG TOG TCG TTA GCA GCA GTA GAT GCT TCC AAT GTT ACT ACT ATT GGT ACT GAT AGT GTG 241 

24 Thr Val Met Ser Ser Ser Leu Ala Ala Val Asp Ala Ser Asn Val Thr Thr He Gly Thr Asp Ser Val 46 

242 ACT TAC GAT CGA CGC TCG TTG ATT ATT AAC GGC CAG AGG AAG CTG CTC ATC TCC GCT TCC ATT CAC TAT 310 

47 Thr Tyr Asp Arg Arg Ser Leu He He Asn Gly Gin Arg Lys Leu Leu He Ser Ala Ser He His Tyr 69 

i 

311 CCT CGC AGT GTC OCT GCC ATG TGG CCT GGT CTG GTT CGA TTG GCG AAG GAA GGA GGA GTG GAT GTT ATT 379 

70 Pro Arg Ser Val Pro Ala Met Trp Pro Gly Leu Val Arg Leu Ala Lys Glu Gly Gly Val Asp Val' He 92 

380 GAA ACG TAT GTT TTC TGG AAC GGT CAC GAA CCT TCT COG GGC AAT TAT TAC TTT GGA GGA AGG TTT GAT 446 

93 Glu Thr Tyr Val Phe Trp Asn Gly His Glu Pro Ser Pro Gly Asn Tyr Tyr Phe Gly Gly Arg Phe Asp 115 

449 CTA GTC AAA TTT TGT AAG ATC ATT CAG CAG GCT GGA ATG TAT ATG ATT CTT CGG ATT GGA CCA TTT GTA 517 

116 Leu Val Lys Phe Cys Lys He He Gin Gin Ala Gly Met Tyr Met He Leu Arg He Gly Pro Phe Val 136 

518 GCT GCA GAA TGG AAC TTT GGT GGA CTT CCT GTG TGG TTG CAT TAT GTG CCA GGT ACC ACC TTT CGG ACT 586 

13 9 Ala Ala Glu Trp Asn Phe Gly Gly Leu Pro Val Trp Leu His Tyr Val Pro Gly Thr Thr Phe Arg Thr 161 

587 GAT AGT GAA CCA TTT AAG TAT CAC ATG CAG AAG TTC ATG ACA TAT ACA GTG AAC TTA ATG AAG AGA GAG 655 

162 Asp Ser Glu Pro Phe Lys Tyr His Met Gin Lys Phe Met Thr Tyr Thr Val Asn Leu Met Lys Arg Glu 184 

656 AGG CTT TTT GCA TCT CAA GGA GGT CCA ATC ATC TTG TCA CAG GTA GAA AAT GAG TAC GGC TAC TAT GAA 724 

165 Arg Leu Phe Ala Ser Gin Gly Gly Pro He He Leu Ser Gin Val Glu Asn Glu Tyr Gly Tyr Tyr Glu 207 

725 AAT GCA TAT GGA GAA GGA GOG AAA AGG TAT GCC TTA TGG GCT GCT AAA ATG GCC CTT TCT CAA AAT ACT 793 



208 Asn Ala Tyr Gly Glu Gly Gly Lys Arg Tyr Ala Leu Trp Ala Ala Lys Met Ala Leu Ser Gin Asn Thr 



230 



794 GGT GTA CCT TGG ATA ATG TGC CAG CAG TAT GAT GCT CCT GAT CCT GTG ATT GAC ACA TGC AAT TCA TTT 862 

231 Gly Val Pro Trp He Met Cys Gin Gin Tyr Asp Ala Pro Asp Pro Val He Asp Thr Cys Asn Ser Phe 253 

863 TAC TGC GAC CAA TTT AAA CCA ATC TCT CCA AAC AAG CCC AAA ATT TGG ACA GAG AAC TGG CCG GGA TGG 931 

254 Tyr Cys Asp Gin Phe Lys Pro He Ser Pro Asn Lys Pro Lys He Trp Thr Glu Asn Trp Pro Gly Trp 276 

932 TTC AAG ACA TTT GGG GCC AGA GAT CCT CAC AGG CCT GCA GAA GAT GTT GCT TAT TCC GTG GCT CGT TTT 1000 

277 Phe Lys Thr Phe Gly Ala Arg Asp Pro His Arg Pro Ala Glu Asp Val Ala Tyr Ser Val Ala Arg Phe 299 

1001 TTC CAA AAA GGA GGA AGC GTG CAG AAT TAT TAC ATG TAC CAT GGT GGG ACG AAC TTT GGC AGG ACA GCA 1069 

300 Phe Gin Lys Gly Gly Ser Val Gin Asn Tyr Tyr Met Tyr His Gly Gly Thr Asn Phe Gly Arg Thr Ala 322 

1070 GGT GGC CCT TTC ATT ACC ACA AGT TAT GAC TAT GAT GCC OCA ATT GAC GAA TAT GGT TTA CCA AGG TTT 1138 

323 Gly Gly Pro Phe He Thr Thr Ser Tyr Asp Tyr Asp Ala Pro He Asp Glu Tyr Gly Leu Pro Arg Phe 345 

1139 CCA AAA TGG GGT CAC CTT AAA GAA CTT CAT AAA GTC ATA AAA TCG TGT GAG CAT GCT CTG CTG AAC AAT 1207 

346 Pro Lys Trp Gly His Leu Lys Glu Leu His Lys Val He Lys Ser Cys Glu His Ala Leu Leu Asn Asn 368 

1208 GAT CCA ACT CTT CTT TCA TTA GGT CCT CTA CAA GAG GCT GAT GTT TAT GAA GAT GCT TCA GGC GCT TGT 1276 

369 Asp Pro Thr Leu Leu Ser Leu Gly Pro Leu Gin Glu Ala Asp Val Tyr Glu Asp Ala Ser Gly Ala Cys 391 

1277 GCT GCC TTT CTC GCG AAT ATG GAT GAC AAA AAT GAC AAG GTG GTA CAG TTC CGA CAT GTA TCA TAC CAC 134S 

392 Ala Ala Phe Leu Ala Asn Met Asp Asp Lys Asn Asp Lys Val Val Gin Phe Arg His Val Ser Tyr His 414 

1346 TTG CCA GCA TGG TCT GTT AGC ATT TTG CCA GAC TGC AAA AAT GTA GCG TTC AAC ACA GCA AAG GTT GGA 1414 

415 Leu Pro Ala Trp Ser Val Ser He Leu Pro Asp Cys Lys Asn Val Ala Phe Asn Thr Ala Lys Val Gly 437 

1415 TGT CAA ACT TCT ATT GTC AAT ATG GCA CCC ATA GAT TTG CAT CCC ACC GCA AGT TCA CCA AAG AGA GAC 1483 

438 Cys Gin Thr Ser He Val Asn Met Ala Pro He Asp Leu His Pro Thr Ala Ser Ser Pro Lys Arg Asp 460 
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Gene /clone name: TBQ7/pZBGfl»-i8; accession number AF154422; Seqqgpse IX> n uaber 7 cont. 



1484 



A TC AAG TCT CTT CAG TGG GAA GTC TTC AAG GAA ACA GCT GGA GTA TOG GGA GTT GCT GAT TTC ACT AAA 1552 



461 He Lys Ser Leu Gin Trp Glu Val Phe Lys Glu Thr Ala Gly Val Trp Gly Val Ala Asp Phe Thr Lys 



483 



1553 AAC GGA TTT GTA GAT CAC ATT AAC ACC ACA AAA GAT GCT ACA GAC TAC CTC TGG TAC ACA ACA AGT ATT 1621 

4B4 Asn Gly Phe Val Asp His He Asn Thr Thr Lys Asp Ala Thr Asp Tyr Leu Trp Tyr Thr Thr Ser lie 506 

1622 TTT GTT CAT GCA GAG GAG GAT TTC CTA AGA AAC AGA GGC ACT GCA ATG CTT TTC GTT GAA TCA AAG GGT 1690 

S07 Phe Val His Ala Glu Glu Asp Phe Leu Arg Asn Arg Gly Thr Ala Met Leu Phe Val Glu Ser Lys Gly 529 

1691 CAT GCT ATG CAT GTC TTC ATC AAT AAA AAG CTT CAA GCC AGT GCA TCT GGA AAT GGC ACA GTG CCA CAG 1759 

530 His Ala Met Hxs Val Phe He Asn Lys Lys Leu Gin Ala Ser Ala Sex Gly Asn Gly Thr Val Pro Gin 552 

1760 TTC AAG TTT GGA ACT CCT ATT GCT CTA AAG GCA GGG AAG AAT GAA ATT TCC TTG TTA AGC ATG ACT GTG 1828 

553 Phe Lys Phe Gly Thr Pro He Ala Leu Lys Ala Gly Lys Asn Glu He Ser Leu Leu Ser Met' Thr Val 575 

1829 GGC CTA CAA ACA GCT GGA GOG TTT TAT GAA TGG ATT GGA GCT GGT CCA ACA AGT GTC AAA GTT GCA GGG 1897 

576 Gly Leu Gin Thr Ala Gly Ala Phe Tyr Glu Trp He Gly Ala Gly Pro Thr Ser Val Lys Val Ala Gly 598 

1898 TTC AAG ACT GGG ACT ATG GAC TTG ACT GCG TCT GCT TGG ACC TAT AAG ATT GGA TTG CAA GGA GAA CAT 1966 

599 Phe Lys Thr Gly Thr Met Asp Leu Thr Ala Ser Ala Trp Thr Tyr Lys He Gly Leu Gin Gly Glu His 621 

1967 TTG AGG ATA CAG AAG TCA TAT AAC TTG AAG AGT AAA ATT TGG GCA CCA ACT TCG CAG CCA CCA AAG CAA 2035 

644 
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622 Leu Arg He Gin Lys Ser Tyr Asn Leu Lys Ser Lys He Trp Ala Pro Thr Ser Gin Pro Pro Lys Gin 

2036 CAG CCC CTC ACA TGG TAT AAG GCA GTA GTA GAT GCG CCT CCT GGT AAT GAA CCT GTT GCA CTT GAT ATG 
645 Gin Pro Leu Thr Trp Tyr Lys Ala Val Val Asp Ala Pro Pro Gly Asn Glu Pro Val Ala Leu Asp Met 

2105 ATT CAT ATG GGA AAA GGA ATG GCT TGG TTG AAT GGA CAA GAA ATT GGC AGA TAT TGG CCG AGG AGA ACT 2173 
668 He His Met Gly Lys Gly Met Ala Trp Leu Asn Gly Gin Glu He Gly Arg Tyr Trp Pro Arg Arg Thr 690 

2174 TCT AAA TAT GAG AAT TGT GTT ACT CAA TGT GAC TAC AGA GGC AAA TTT AAC CCT GAT AAG TGT GTC ACT 2242 
691 Ser Lys Tyr Glu Asn Cys Val Thr Gin Cys Asp Tyr Arg Gly Lys Phe Asn Pro Asp Lys Cys Val Thr 713 

2243 GGC TGT GGA CAA CCT ACA CAG AGA TGG TAT CAT GTG CCA CGA TCT TGG TTC AAG CCA TCA GGA AAT GTC 2311 

.714 Gly Cys Gly Gin Pro Thr Gin Arg Trp Tyr His Val Pro Arg Ser Trp Phe Lys Pro Ser Gly Asn Val 736 

2312 TTA ATT ATC TTT GAG GAA ATA GGT GGA GAT CCC TCT CAA ATT AGA TTC TCA ATG CGA AAG GTT TCT GGA 
737 Leu He He Phe Glu Glu He Gly Gly Asp Pro Ser Gin He Arg Phe Ser Met Arg Lys Val Ser Gly 

2381 GCT TGT GGT CAT CTT TCA GTG GAC CAT CCA TCC TTT GAT GTT GAA AAT CTG CAA GGA AGT GAA ATT GAG 
760 Ala Cys Gly His Leu Ser Val Asp His Pro Ser Phe Asp Val Glu Asn Leu Gin Gly Ser Glu He Glu 

2450 AAC GAC AAA AAC AGG CCA ACT CTA AGT TTG AAA TGC CCC ACA AAT ACT AAT ATT TCC TCT GTC AAA TTT 2518 

805 



2380 
759 



2449 
782 
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783 Asn Asp Lys Asn Arg Pro Thr Leu Ser Leu Lys Cys Pro Thr Asn Thr Asn He Ser Ser Val Lys Phe 

2519 GCC AGC TTT GGA AAT CCT AAT GGT ACA TGT GGC TCC TAC ATG CTA GGA GAC TGC CAC GAT CAG AAT TCT 2587 

806 Ala Ser Phe Gly Asn Pro Asn Gly Thr Cys Gly Ser Tyr Met Leu Gly Asp Cys His Asp Gin Asn Ser 828 

2588 GCA GCA CTG GTC GAA AAG GTT TGC CTG AAC CAA AAT GAG TGT GCA TTA GAA ATG TCC AGC GCA AAC TTT 
829 Ala Ala Leu Val Glu Lys Val Cys Leu Asn Gin Asn Glu Cys Ala Leu Glu Met Ser Ser Ala Asn Phe 

2657 AAC ATG CAA TTG TGT CCA AGT ACA GTA AAG AAA CTT GCA GTT GAA GTG AAT TGC AGC TGA GTGTCATTGCCC 2728 

852 Asn Met Gin Leu Cys Pro Ser Thr Val Lys Lys Leu Ala Val Glu Val Asn Cys Ser *** 871 

2729 AAAATGAATGACATATTCTAATTTATATAG^ 2820 

2821 CTTTCGTCTATGATTTGAAGTTTAA 2912 

2913 GTATTCATATTTATTCAGTCGAGATGCAAGA 2972 
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Figure 4. Autoradiograph of northern blot analysis of TBG expression in various 
plant tissues. Twenty ng of total RNA extracted from flowers (F), leaves (L), roots (R) 
and stems (S) was loaded in each lane. RNAs were separated in an agarose gel and 
transferred to nylon membrane. Blots were hybridized using the probes indicated to the 
right, washed to a final stringency of 0.1X SSC at 65°C and were used to expose x-ray 
film. A 26S ribosomal gene clone from soybean was used as a loading control for each 
blot and one example is shown. 
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Figure 5. Autoradiograph of northern blot analysis of TBG expression in fruit 
tissues Twenty ug of total RNA extracted from peel and outer pericarp tissue was 
loaded in each lane. Fruit were harvested at 1 0, 20, 30, 35, and 40 days post- 
polienation and at the breaker (Br), turning (Tr), pink (Pk), red (Rd) and over npe (OR) 
stages RNAs were separated in an agarose gel and transferred to nylon membrane. 
Blots were hybridized using the probes indicated to the right, washed to a final strin- 
gency of 0. 1X SSC at 65°C and were used to expose x-ray film. A 26S ribosomal gene 
clone from soybean was used as a loading control for each blot and one example is 
shown. A cDNA clone for tomato polygalacturonase (PG) was also used as a probe to 
show a well characterized, fruit-ripening-specific control. 
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Figure 6. Autoradiograph of northern blot analysis of TBG expression in fruit 
tissues. Twenty \xg of total RNA extracted from mature green or turning stage fruit peel 
(P), outer pericarp (OP) t inner pericarp (IP) and locular (L) tissue was loaded in each 
lane. RNAs were separated in an agarose gel and transferred to nylon membrane. Blots 
were hybridized using the probes indicated to the right, washed to a final stringency of 
0.1 X SSC at 65°C and were used to expose x-ray film. A 26S ribosomal gene clone 
from soybean was used as a loading control for each blot and one example is shown. A 
cDNA clone for tomato polygalacturonase (PG) was also used as a probe to show a 
well characterized, fruit-ripening-specific control. 
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Figure 7. Autoradiograph of northern blot analysis of TBG expression in normal 
and mutant fruit tissues. Twenty \ig of total RNA extracted from peel and outer peri- 
carp tissue at various days post-pollination (dpp) was loaded in each lane. RNAs were 
separated in an agarose gel and transferred to nylon membrane. Blots were hybridized 
using the probes indicated to the right, washed to a final stringency of 0.1 X SSC at 
65°C and were used to expose x-ray film. A 26S ribosomal gene clone from soybean 
was used as a loading control for each blot and one example is shown. A cDNA clone 
for tomato polygalacturonase (PG) was also used as a probe to show a well character- 
ized, fruit-ripening-specific control. The - and > marks on the left indicate the position of 
the tomato 27S and 18S rRNAs respectively. 
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Figure 8. Autoradiograph of northern blot analysis of TBG expression in response 
to ethylene treatment of mature green fruit tissues. Twenty \xg of total RNA ex- 
tracted from peel and outer pericarp tissue at various times (1 , 2, 12 and 24 hours) after 
treatment with (+) or without (-) 10 ppm ethylene was loaded in each lane. RNAs were 
separated in an agarose gel and transferred to nylon membrane. Blots were hybridized 
using the probes indicated to the right, washed to a final stringency of 0.1X SSC at 
65°C and were used to expose x-ray film. A 26S ribosoma! gene clone from soybean 
was used as a loading control for each blot and one example is shown. The - marks on 
the left indicate the position of the tomato 27S rRNA. 
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Figure 9. Western blot analysis of TBG4 expression by yeast A yeast done was 
isolated that secreted high levels of FLAG-TBG4 fusion protein into the culture medium. 
Protein samples were separated in an 8% acryiamide gel, transferred to nitrocellulose 
and were blotted with M1 anti-FLAG primary antibody. Blots were washed and blotted 
with an aikaline-phosphatase conjugated secondary antibody and alkaline phosphatase 
activity was detected using Sigma Fast substrate. Lane 1, culture medium of an 
untransformed yeast clone was used as a negative control. Lane 2, culture medium of 
yeast clone expressing FLAG-TBG4 fusion protein. Lane 3, Affinity purified FLAG-TBG4 
fusion protein. 
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Figure 12. Northern blot analysis of TBG4 expression in transgenic fruit contain- 
ing TBG4 antisense construct. A. Total RNA was extracted from mature green/42 
days post-polienation (MG), turning/breaker + 3 (TR) and red/breaker + 7 (RD) fruit and 
twenty ^g was loaded in each lane. RNAs were separated in an agarose gel and trans- 
ferred to nylon membrane. Blots were hybridized using the probes indicated to the right, 
washed to a final stringency of 0.1X SSC at 65°C and were used to expose x-ray film. 
A 26S ribosomal gene clone from soybean was used as a loading control. The marks - 
and { denote the positions of the endogenous TBG4 and antisense mRNAs respec- 
tively. Lanes 2-8 correspond to transgenic lines 2-8 in Figures 1 1A-E. B. Chart of TBG4 
mRNA levels in lines 2-8. Autoradiographs were scanned using a densitometer and 
TBG4 mRNA levels were corrected against the loading controls. TBG4 mRNA levels 
are shown in arbitrary units. 
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SEQUENCE LISTING 



<1 10> U.S. Department of Agriculture 

<120> Genes Coding For Tomato Beta-Calactosidase Polypeptides 

<130> 70608/125650 

<140> PCT/US99/12697 
<141> 1999-06-08 

<150> 60/088,805 
<151> 1998-06-09 

<160> 14 

<170> Patentln Ver. 2,1 

<210> 1 

<211>3224 

<212>DNA 

<213> Lycopersicon esculentum 
<400> 1 

ttttttcttt gttcttttgc tcagcactag agcctagaag aaggaaaaaa agaagtatgg 60 
actaatggaa taaacataaa aaagagagaa aaaaaaaaaa gaagaaaatt cttcagacaa 120 
caaaaacagc tgttttccct tcactacttt ttttttccca atctctatat aattgcaaga 180 
atagaataaa gtttgcaact tgattaaaaa aaaagaataa taagctgtgg gggtagggag 240 
gaagttagtt cattagttca ttgccttgta aaggcacaat cttgattctt gatttgttga 300 
caaatatggg tttttggatg gcaatgttgc tgatgttgtt attgtgttta tgggtttctt 360 
gtggaattgc ttctgtttca tatgaccata aagctatcat tgtaaatgga caaagaaaaa 420 
ttctcatttc tggatccatt cactacccta gaagcacccc tgagatgtgg ccagatctta 480 
ttcagaaggc aaaagaaggg ggagttgatg ttatacagac ttatgttttc tggaatgggc 540 
atgagcctga agaagggaaa tattattttg aagagaggta tgatttagtg aagttcatta 600 
aagtggtgca agaagcagga ctttatgtgc atcttaggat tggaccttat gcatgtgctg 660 
aatggaattt tgggggtttt cctgtttggc tgaagtatgt tccaggtatt agtttcagaa 720 
caaacaatga gccattcaag gctgcaatgc aaaagttcac tactaagatt gttgatatga 780 
tgaaagcaga aaagctctat gaaactcagg gtggtccaat tattctatct cagatagaaa 840 
atgaatatgg acctatggag tgggaactag gtgaacctgg taaagtttac tcagaatggg 900 
cagccaaaat ggctgtggat cttggcactg gtgtcccatg gatcatgtgc aagcaagatg 960 
atgtccctga tcctattatt aatacttgca atggtttcta ctgtgactac ttcacaccaa 1020 
ataaggctaa taaacccaag atgtggactg aagcctggac tgcctggttt accgaatttg 1080 
gaggtccagt tccttaccgt cctgcagagg atatggcatt tgctgtcgca agatttatac 1 140 
aaacgggagg ctccttcatc aattactata tgtatcatgg aggaacaaac tttggaagga 1200 
cttctggtgg cccatttatt gctactagtt atgattatga tgcaccccta gatgaatttg 1260 
ggtcattacg gcagcctaaa tggggtcatc tgaaagatct acatagagca ataaagctct 1320 



# • 



gtgagccagc tttagtatct gtagatccaa ctgtgacatc cttaggaaac tatcaagagg 1380 
cacgtgtttt caagtcagag tctggggcct gcgctgcctt cctagcaaat tacaaccagc 1440 
actcttttgc taaagtggca tttgggaaca tgcattataa cttgccaccc tggtctatca 1500 
gcattcttcc cgactgcaag aacactgtct ataatactgc aagggttggt gctcaaagtg 1560 
ctcagatgaa gatgactcca gtcagtagag gattctcatg ggagtcattc aatgaagacg 1620 
cagcatcgac tgaagacgac actttcacag ttgttgggtt attggagcag attaatatca 1680 
caagagatgt atctgattac ttgtggtata tgactgacat tgagattgat ccaacagaag 1740 
gatttttgaa tagtggaaat tggccttggc ttactgtctt ttctgctggc catgcattgc 1800 
atgtattcgt gaatggtcaa ttagcaggaa ctgtgtacgg aagtttagaa aacccaaaac 1860 
taactttcag caacggtata aatctgagag ctggtgtgaa caagatttct ctgctaagca 1920 
ttgctgttgg tcttccgaac gttggccctc attttgagac atggaatgct ggtgttcttg 1980 
gaccagtttc acttaatgsa cttaatgaag gaacaagaga tttaacatgg cagaaatggt 2040 
tctacaaggt tggtctaaaa ggagaagccc tgagtcttca ttcactcagt ggtagcccat 2100 
ccgtggagtg ggtggaaggc tctttagtgg ctcagaagca gccactcagt tggtataaga 2160 
ctacattcaa tgctccagat ggaaatgaac ctttggcttt agatatgaat accatgggca 2220 
aaggtcaagt atggataaat ggtcagagcc tcggacgcca ctggcctgca tataaatcat 2280 
ctggaagttg tagtgtctgt aactatactg gctggtttga tgagaaaaag tgcctaacta 2340 
actgtggtga gggctcacaa agatggtacc acgtaccccg gtcttggctg tatcctactg 2400 
gaaatttgtt agttgtattc gaggaatggg gaggagatcc ttatggaatc actttagtca 2460 
aaagagaaat agggagtgtt tgtgctgata tatatcagtg gcaaccacag ttattgaatt 2520 
ggcagaggct agtatctggt aagtttgaca gacctctcag acctaaagcc catcttaagt 2580 
gtgcacctgg tcagaagatt tcttcaatca aatttgcaag ctttggaaca ccagagggag 2640 
tttgtgggaa cttccagcag ggaagctgcc atgctccgcg ctcatatgat gctttcaaaa 2700 
agaattgtgt tgggaaagag tcttgctcag tacaggtaac accagagaat tttggaggtg 2760 
atccatgtcg aaacgttcta aagaaactct cagtggaagc catttgtagt tgatgattct 2820 
gagtatacaa gtgaaaaaat acttgaacca ctcatataaa catttttcaa acgagctact 2880 
agacatccat taacccacac taccattttt tggctttgct ggggttgaag ttgtacagtt 2940 
aagcaacaca cctctttgat caaagctcac ctgattatga agatgattga cgaaagattc 3000 
tgtacatgta aggtttcgtc taattacaca tacagatatg attcttgatg aatcgatgtg 3060 
caaattttgt ttgtgttagg gtgagagaga cttgaaaagc attttgcttt catgatgttc 3 120 
tacattatac aatcataatg taagtaagca agcaataatt cattgctttg cacattgaaa 3 1 80 
aatgcatttt actatgttgc agtacaaaaa aaaaaaaaaa aaaa 3224 

<210>2 

<211>2984 

<212>DNA 

<213> Lycopersicon esculentum 
<400> 2 

ggagcagaag aaaaacactg aattttccgt taatactaac ggtgttaact atccactttg 60 
tgatcgtcgc cggcgagtat ttcaagccgt tcaatgtcac ctacgataac cgagctctca 120 
tcatcggcgg taaacgccgt atgcttatct ccgccggaat tcactaccct cgcgccactc 180 
ctgagatgtg gcccacattg atagctagga gcaaagaagg tggtgcagat gtcatcgaga 240 
cttatacatt ttggaatggt catgagccaa ccaggggaca gtacaatttt gaaggaagat 300 
atgatattgt caagttcgca aagctagtcg gatctcatgg actgttcctc tttattcgaa 360 



# 



taggtcctta tgcctgtgca gaatggaact tcgggggatt ccccatatgg cttcgtgata 420 
tacctggaat agaatttcga acagataatg caccattcaa ggaggagatg gagcgctatg 480 
ttaaaaagat agttgatctt atgatatctg agtcgctctt ttcgtggcaa ggtggtccta 540 
tcattttgct gcagattgaa aatgaatatg gaaatgttga aagctcattc ggtcccaagg 600 
ggaagttata tatgaaatgg gctgctgaaa tggctgttgg tcttggtgct ggtgttccat 660 
gggtcatgtg caggcaaact gatgctccag aatacatcat agatacttgt aatgcatact 720 
attgtgatgg gttcacgccg aattccgaga agaaaccgaa aatttggact gagaattgga 780 
atggatggtt tgcagattgg ggtgaaagac ttccatatag accttccgag gatattgcat 840 
ttgcaattgc tcgtttcttt caacgtgggg gcagcttaca gaactattat atgtattttg 900 
gtgggacaaa ttttggccgg actgctggtg gcccaactca aatcactagc tatgattatg 960 
atgctccact ggatgaatat ggactactac gtcaacctaa atggggccat ttgaaggatc 1020 
tgcatgctgc tataaagctt tgtgaaccag ctcttgttgc tgctgattca cctcagtata 1080 
ttaaactggg accaaaacag gaggcacatg tctatcgtgg aacatccaac aacattggcc 1 140 
aatatatgtc cttaaatgaa ggcatatgcg cagcatttat tgcaaatatt gatgaacatg 1200 
aatcagcaac agtgaaattt tacggtcaag agttcacttt acctccatgg tcagtggtat 1260 
tctgccagat tgcagaaata cagctttcaa cacagctaag gtgggggcac aaacttcaat 1320 
caaaacagtg ggctcagatt ctgtttcagt tgggaataat tctttctttc tacaagttat 1380 
cactaaaagc aagctcggaa agtttttcac aatcttggat gacattgaag gagccacttg 1440 
gtgtgtgggg tgacaagaat ttcacttcta aaggaatact ggagcatctg aatgtgacaa 1500 
aagaccagtc tgattacctg tggtatctga ccaggatata tatttctgat gatgacatct 1560 
cattttggga ggaaaatgat gttagtccaa caattgatat tgatagcatg cgtgattttg 1620 
ttcgcatttt tgttaatggg cagcttgcag gtagtgtgaa aggcaaatgg atcaaggtgg 1680 
ttcaacctgt taagctggtt cagggataca acgacatact gctattatct gagacggtgg 1740 
gattgcagaa ttatggtgcc ttcttggaga aggatggggc aggttttaaa ggtcagataa 1800 
agcttacagg atgcaaaagc ggggatatca atctcacaac atctttatgg acctaccagg 1860 
tggggcttag aggcgaattc ctggaagtat atgatgtcaa tagtactgaa agtgcaggat 1920 
ggactgagtt tcccactggt acaactccgt cagtcttttc gtggtacaag acaaagtttg 1980 
atgccccagg cgggacagat ccagttgctc ttgattttag tagcatggga aaaggtcagg 2040 
catgggttaa tggccaccat gtaggaagat attggacttt ggttgcacca aataatggat 2100 
gtggaagaac ttgtgattat cgtggtgctt accactctga taaatgtagg acaaactgtg 2160 
gagagattac tcaggcctgg taccatatac ctagatcatg gctaaagaca ttaaataatg 2220 
tactagttat ctttgaagaa acagataaaa ctccgtttga tatttccatt tctacgcgtt 2280 
ctactgaaac catttgtgct caagtatcgg aaaagcacta tccacctcta cataagtggt 2340 
ctcattcgga gtttgacaga aagttgtctc tgatggataa aacaccagaa atgcacttgc 2400 
agtgtgacga aggacataca atctcttcta ttgaatttgc aagctatgga agtccgaatg 2460 
gcagctgtca aaagttctca caaggaaaat gccatgctgc aaattccttg tctgttgtat 2520 
ctcaggcttg tataggaaga actagttgca gcattggcat ttccaatggt gtatttggag 2580 
atccatgtcg acacgttgtg aagagtttgg ctgttcaagc aaaatgctca ccaccaccag 2640 
acctcagcac ttcagcttcc tcgtgaggag actctggtaa cacgttaacc ttttagaacg 2700 
aaacgatccc ttaaagtcca ctcgttcccc tgcccccgag ccctctgcta catttctcag 2760 
atcgcatcgt tacaatcagg cggagaaaac gtacatggac gattttactt gtaaatattt 2820 
ggttactgta tataaaatga aaggaataat gttgcttatg catatgagct gcaaattata 2880 
tgacaaagta acaaatgaaa atagaaaact cctgtctgtc aaagaatttt aacaacacca 2940 
tttattaaaa gttagttaac atgattaaaa aaaaaaaaaa aaaa 2984 



<210>3 
<211> 3069 
<212>DNA 

<213> Lycopersicon esculentum 
<400> 3 

agagttcatt attttttttg cattttgaaa aagaggaaaa aaataaagtt aaaggggggg 60 
gaaaaagttt tcattttgcc ttaaaaaggc acaatcttga tagaaaagga gataatttta 120 
catgggttgt acgcttatac taatgttgaa tgtgttgttg gtgttgttgg gttcatgggt 180 
tttttctgga acagcttctg tttcatatga ccatagggct attattgtaa atggacaaag 240 
aagaatactt atttctggtt ctgttcatta tccaagaagc actcctgaga tgtggccagg 300 
tattattcaa aaggctaaag aaggaggtgt ggatgtgatt cagacttatg ttttctggaa 360 
tggacatgag cctcaacaag ggaaatatta ttttgaaggg agatatgatt tagtgaagtt 420 
tattaagctg gtgcaccaag caggacttta tgtccatctt agagttggac cttatgcttg 480 
tgctgaatgg aattttgggg gctttcctgt ttggctgaaa tatgttccag gtatcagttt 540 
cagaacagat aatggacctt tcaaggctgc aatgcaaaaa tttactgcca agattgtcaa 600 
tatgatgaaa gcggaacgtt tgtatgaaac tcaagggggg ccaataattt tatctcagat 660 
tgagaatgaa tatggaccca tggaatggga actgggagca ccaggtaaat cttacgcaca 720 
gtgggccgcc aaaatggctg tgggtcttga cactggtgtc ccatgggtta tgtgcaagca 780 
agacgatgcc cctgatccta ttataaatgc ttgcaatggc ttctactgtg actacttttc 840 
tccaaacaag gcttataaac caaagatatg gactgaagcc tggactgcat ggtttactgg 900 
ttttggaaat ccagttcctt accgtcctgc tgaggacttg gcattttctg ttgcaaaatt 960 
tatacagaag ggaggttcct tcatcaatta ttacatgtat catggaggaa caaactttgg 1020 
acggactgct ggtggtccat ttattgctac tagttatgac tatgatgcac cacttgatga 1080 
atatggatta ttgcgacaac caaaatgggg tcacctgaaa gatctgcata gagcaataaa 1 140 
gctttgtgaa ccagctttag tctctggaga tccagctgtg acagcacttg gacaccagca 1200 
ggaggcccat gtttttaggt cgaaggctgg ctcttgtgct gcattccttg ctaactacga 1260 
ccaacactct tttgctactg tgtcatttgc aaacaggcat tacaacttgc catggtggtc 1320 
aatcagcatt cttcccgact gcaagaacac tgtatttaat acagcacgga tcggtgctca 1380 
aagtgctcag atgaagatga ctccagtcag cagaggattg ccctggcagt cattcaatga 1440 
agagacatca tcttatgaag acagtagttt tacagttgtt gggctattgg aacagataaa 1500 
tacaacaaga gacgtgtctg attatttgtg gtattcaaca gatgtcaaga ttgattcaag 1560 
agaaaagttt ttgagaggcg gaaaatggcc ttggcttacg atcatgtcag ctgggcatgc 1620 
attgcatgtt tttgtgaatg gtcaattagc aggaactgca tatggaagtt tagaaaaacc 1680 
gaaactaact ttcagtaaag ccgtaaatct gagagcaggt gttaacaaga tttctctact 1740 
gagcattgct gttggccttc cgaatatcgg cccacatttt gagacatgga atgctggtgt 1800 
tcttgggcca gtctcactaa ctggtcttga cgaggggaaa agagatttaa catggcagaa 1860 
atggtcttac aaggttggtc taaaaggaga agccttgagc ctccattcac tcagtggtag 1920 
ctcgtcagtt gagtgggtcg agggttcttt agtggctcag agacagccac tcacatggta 1980 
caagagcact tttaatgctc cagctggaaa tgatcctttg gctttagact tgaataccat 2040 
gggcaaagga caagtgtgga taaatggtca aagcctcgga cgctattggc ctggatataa 2100 
agcatctggt aactgcggtg cctgtaacta tgcaggctgg tttaatgaga aaaaatgcct 2160 
aagtaactgt ggagaggctt cacaacgatg gtatcatgtt ccccgttctt ggctgtatcc 2220 
tactggaaat ttgttagttc tatttgagga atggggagga gagcctcatg gaatctcttt 2280 
ggtaaaaaga gaagttgcaa gtgtttgtgc agatataaac gaatggcaac cacagttggt 2340 
gaattggcaa atgcaagcat ctggtaaagt tgacaaacca ctgagaccta aagctcacct 2400 




• 



ctcgtgtgct tctggtcaga agattacttc aatcaaattt gcaagctttg gaccaccaca 2460 
aggggtctgc ggaagcttcc gtgaaggaag ctgccacgcc ttccactcat atgatgcttt 2520 
tgaaaggtat tgcatcgggc aaaactcgtg ctcagtacct gtaacaccag agatctttgg 2580 
aggtgatcca tgtccacatg ttatgaagaa actctcagtt gaggttattt gcagttgatg 2640 
acactgagga gaaacaaata aaagtggttt cagttagttg tctgaacata tcaaaaagtt 2700 
ggctttgatg gaggtgaagt tgtacagata tgcaacacac ctttccattt gaggcacata 2760 
tgaattgcaa tggcccaaga ttctgtacat atatgttggt tactgtcaag ttggtattgg 2820 
tttgcaaatg taaaacagta gtatagtcat tggttcaagt gcgcatcgag attgtgctag 2880 
tgggaggtag taggtaccga tcgatctatc gttgtttgca caagctgggc ctagattcca 2940 
ctattattat aacaaagaaa gcacaatgag actgattctt gattagtcca tgtgtagata 3000 
ttgttactgt tggatttgca aatcttgtga tttcagcaaa aaaaaaaaaa aaaaaaaaaa 3060 
aaaaaaaaa 3069 



<210>4 

<211>2554 

<212>DNA 

<213> Lycopersicon esculentum 
<400>4 

aaaaaaagtt tcaatttttt ttctaaaata aaaaaaaatt catttttttt gaatgtggaa 60 
aaaatgctaa ggactaatgt gttgttgtta ttagttattt gtttattgga ttttttttct 120 
tcagtgaaag ctagtgtttc ttatgatgac agagctataa tcataaatgg gaaaagaaaa 180 
attcttattt ctggttcaat tcattatcca agaagcactc cacagatgtg gcctgatctt 240 
atacaaaagg ctaaagatgg aggcttagat gttattgaaa cttatgtttt ctggaatgga 300 
catgagcctt ctcctggaaa atataatttt gaaggaagat atgatcttgt tagattcatc 360 
aaaatggtac aaagagcagg actttatgtc aatttacgta ttggccctta cgtctgtgct 420 
gaatggaact ttgggggatt ccctgtttgg ctaaaatatg tgcctggtat ggaatttaga 480 
acaaacaatc agccttttaa ggtggctatg caaggatttg ttcagaaaat agtcaacatg 540 
atgaagtcag aaaatttgtt tgaatctcaa ggaggaccaa taattatggc ccagatacaa 600 
aatgagtatg gaccagtaga atgggaaatt ggtgctcctg gtaaagctta tacaaaatgg 660 
gcagctcaaa tggctgtagg tttgaaaact ggtgtcccat ggatcatgtg taagcaagag 720 
gatgctcctg atcctgtgat tgatacttgt aatggcttct actgcgaagg gttccgtcct 780 
aataagcctt acaaacctaa aatgtggaca gaagtatgga ctggctggta tacgaaattc 840 
ggtggtccaa ttcctcaaag accagccgaa gacattgcat tttcagttgc caggtttgtt 900 
cagaacaatg gttcattctt caattactac atgtatcatg gaggaacaaa ttttggccgg 960 
acatcatcag ggcttttcat tgcaactagc tacgattatg atgctcctct cgatgaatat 1020 
gggttgctga atgaaccaaa gtatgggcac ttgagagact tacataaagc tatcaagcta 1080 
tctgaaccgg ctttagtttc atcatatgct gcggtgacta gtcttggaag taatcaagag 1 140 
gctcatgttt atagatcaaa atctggagct tgtgctgctt ttttatccaa ctatgactct 1200 
agatattcag taaaagtcac ctttcagaat aggccataca atctgcctcc atggtccatc 1260 
agcattcttc ccgactgcaa aactgccgtt tacaacactg cacaggttaa ctctcaaagc 1320 
tcgagcataa agatgacgcc tgcaggtggt ggattctctt ggcagtcata caatgaagaa 1380 
acgcctactg ctgatgacag cgatacactt acagctaacg gactatggga acagaaaaac 1440 
gtcacaagag attcatcaga ctatctgtgg tacatgacaa atgtaaatat agcatctaat 1500 
gaaggatttc taaagaacgg aaaggatcct tatctcactg ttatgtccgc tggtcatgtc 1560 



ttgcatgttt tcgtcaatgg aaaactatca ggaactgttt atggtacatt ggataatcca 1620 
aaacttacat acagtggcaa cgtgaagtta agagctggta ttaacaagat ttctctgctc 1680 
agtgtttccg ttggtctccc gaacgttggc gtgcattatg atacatggaa tgcaggagtt 1740 
ctaggtccag tcacgttgag cggtctcaat gaagggtcaa gaaacttggc gaaacagaaa 1800 
tggtcttaca aggttggtct gaaaggcgaa tcgttaagtc ttcactcctt aagtgggagt 1860 
tcttctgttg aatgggttcg aggttcacta atggctcaaa agcagcccct gacttggtac 1920 
aaggctacat ttaacgcgcc tggaggaaat gatccactag ctttagacat ggcaagtatg 1980 
ggaaaaggtc agatatggat aaatggtgaa ggcgtaggtc gccattggcc tggatacata 2040 
gcacaaggcg actgcagcaa atgcagttat gctggaacgt tcaacgagaa gaagtgcgga 2100 
actaactgcg gacaaccttc tcagagatgg taccatgttc cacgatcgtg gctgaaacca 2160 
agtggaaact tgttagtagt attcgaagaa tggggaggta atccaacagg aatttctcta 2220 
gtcaggagat caagataaag aactcgaaaa gtaaaacttg ttcagtaact atggtgcttg 2280 
aattcgcgcc gaaaaataca tacacgaagc taacaatgga ggctacagtt tgcaaattgc 2340 
agctgaataa aacattagaa gataaagaaa tatttgatta aaaggagtat ataaatttac 2400 
agagaatttt ctttattctt tgtaaaactt tggtttataa agtttataca gaattttctg 2460 
ttatttggat tatgagattg aagaagattg tacagcttcc aaatactatt agaatacaaa 2520 
taaatttcat gtaaaaaaaa aaaaaaaaaa aaaa 2554 



<210>5 

<211>755 

<212>DNA 

<213> Lycopersicon esculentum 
<400> 5 

atccagactt acgttttctg gaaccttcat gaacctgttc gaaatcagta tgattttgaa 60 
ggaaggaaag atttgattaa ttttgtgaag ttggtggaga gagctggctt atttgttcat 120 
ataaggattg ggccttatgt ttgtgcagaa tggaactatg gtgggtttcc tctttggttg 180 
catttcattc ctggaattga atttcgaacc gacaatgaac cgttcaaggc agaaatgaag 240 
cgattcacag ctaaaattgt tgacatgatc aagcaagaaa atctatatgc atcccagggt 300 
gggccggtta tcttgtctca gatagaaaat gagtatggca atggtgatat tgagtctcgt 360 
tatggtcctc gtgccaaacc ttacgtgaac tgggcagcat caatggctac gtctttaaat 420 
acgggagtgc catgggttat gtgtcagcaa ccagatgccc ctccttccgt tattaacact 480 
tgcaatggat tttattgtga ccaattcaag caaaattccg ataaaacacc caagatgtgg 540 
actgagaatt ggaccggatg gtttctgtcg tttggtggtc ctgtccctta cagaccagtg 600 
gaagacatcg ctttcgctgt ggctcgattt ttccagcgag gcggaacttt ccagaactat 660 
tacatgtacc acgggggaac taactttggg agaaccagtg gtggaccgtt tattgcaact 720 
agctatgact atgatgcccc tctcgacgaa tacgg 755 



<210>6 
<211>749 
<212> DNA 

<213> Lycopersicon esculentum 



<400> 6 



atccagacat atgttttttg gaatgttcat gagccttctc ctggcaatta caattttgaa 60 
ggaagatatg acctggtgag gtttgtaaaa acgattcaga aagcagggct gtatgctcat 120 
cttcgaattg gcccttacgt ttgtgcagag tggaattttg gagggtttcc agtatggctg 180 
aagtatgtac ctggcattag cttcagagct gataatgaac ctttcaagaa cgcaatgaaa 240 
gggtatgctg agaaaattgt taacttgatg aagatcataa tcttttcgag tctcagggtg 300 
gtccaatcat actctcacag attgagaatg agtatgggcc tcaagccaag gtacttggag 360 
caccgggaca tcagtattca acatgggctg caaatatggc agttggattt gaacacaggc 420 
gtcccatggg tgatgtgcaa ggaagaagat gcaccagatc ctgtgatcaa cacatgcaat 480 
ggttlctact gtgataattt cttcccaaac aaaccataca aacctgcaat ttggactgaa 540 
gcttggagtg gatggttctc ggaatttggc ggtccccttc atcagagacc agttcaggat 600 
ttggcatttg ctgttgccca atttatacaa agaggaggat cttttgttaa ctattacatg 660 
tacatggggg gcacgaactt tggacgcact gcgggtgggc cattcatcac taccagctat 720 
gattatgatg cccccctcga cgagtatgg 749 



<210>7 

<211>2972 

<212>DNA 

<213> Lycopersicon esculentum 
<400> 7 

gcaacttctc cggtgaataa caccggtaaa cggccaatgc caactctcgt cggaatctga 60 
atagtgattt aagcagctta gctagctaac ttttgcctct gcaatgaaca caatgagttg 120 
tttgtcctct aatttcaagt tcgttttcct tgcctcgact gtgatatgga tgacggtaat 180 
gtcgtcgtcg ttagcagcag tagatgcttc caatgttact actattcgta ctgatagtgt 240 
gacttacgat cgacgctcgt tgattattaa cggccagagg aagctgctca tctccgcttc 300 
cattcactat cctcgcagtg tccctgccat gtggcctggt ctggttcgat tggcgaagga 360 
aggaggagtg gatgttattg aaacgtatgt tttctggaac ggtcacgaac cttctccggg 420 
caattattac tttggaggaa ggtttgatct agtcaaattt tgtaagatca ttcagcaggc 480 
tggaatgtat atgattcttc ggattggacc atttgtagct gcagaatgga actttggtgg 540 
acttcctgtg tggttgcatt atgtgccagg taccaccttt cggactgata gtgaaccatt 600 
taagtatcac atgcagaagt tcatgacata tacagtgaac ttaatgaaga gagagaggct 660 
ttttgcatct caaggaggtc caatcatctt gtcacaggta gaaaatgagt acggctacta 720 
tgaaaatgca tatggagaag gagggaaaag gtatgcctta tgggctgcta aaatggccct 780 
ttctcaaaat actggtgtac cttggataat gtgccagcag tatgatgctc ctgatcctgt 840 
gattgacaca tgcaattcat tttactgcga ccaatttaaa ccaatctctc caaacaagcc 900 
caaaatttgg acagagaact ggccgggatg gttcaagaca tttggggcca gagatcctca 960 
caggcctcac gaagatgttg cttattccgt ggctcgtttt ttccaaaaag gaggaagcgt 1020 
gcagaattat tacatgtacc atggtgggac gaactttggc aggacagcag gtggcccttt 1080 
cattaccaca agttatgact atgatgcccc aattgacgaa tatggtttac caaggtttcc 1 140 
aaaatggggt caccttaaag aacttcataa agtcataaaa tcgtgtgagc atgctctgct 1200 
gaacaatgat ccaactcttc tttcattagg tcctctacaa gaggctgatg tttatgaaga 1260 
tgcttcaggc gcttgtgctg cctttctcgc gaatatggat gacaaaaatg acaaggtggt 1320 
acagttccga catgtatcat accacttgcc agcatggtct gttagcattt tgccagactg 1380 
caaaaatgta gcgttcaaca cagcaaaggt tggatgtcaa acttctattg tcaatatggc 1440 
acccatagat ttgcatccca ccgcaagttc accaaagaga gacatcaagt ctcttcagtg 1500 



ggaagtcttc aaggaaacag ctggagtatg ggcagttgct gatttcacta aaaacggatt 1560 
tgtagatcac attaacacca caaaagatgc tacagactac ctctggtaca caacaagtat 1620 
ttttgttcat gcagaggagg atttcctaag aaacagaggc actgcaatgc ttttcgttga 1680 
atcaaagggt catgctatgc atgtcttcat caataaaaag cttcaagcca gtgcatctgg 1740 
aaatggcaca gtgccacagt tcaagtttgg aactcctatt gctctaaagg cagggaagaa 1800 
tgaaatttcc ttgttaagca tgactgtggg cctacaaaca gctggagcgt tttatgaatg 1860 
gattggagct ggtccaacaa gtgtcaaagt tgcagggttc aagactggga ctatggactt 1920 
gactgcgtct gcttggacct ataagattgg attgcaagca gaacatttga ggatacagaa 1980 
gtcatataac ttgaagagta aaatttgggc accaacttcg cagccaccaa agcaacagcc 2040 
cctcacatgg tataaggcag tagtagatgc gcctcctggt aatgaacctg ttgcacttga 2100 
tatgattcat atgggaaaag gaatggcttg gttgaatgca caagaaattg gcagatattg 2160 
gccgaggaga acttctaaat atgagaattg tgttactcaa tgtgactaca gagccaaatt 2220 
taaccctgat aagtgtgtca ctggctgtgg acaacctaca cagagatggt atcatgtgcc 2280 
acgatcttgg ttcaagccat caggaaatgt cttaattatc tttgaggaaa taggtggaga 2340 
tccctctcaa attagattct caatgcgaaa ggtttctgga gcttgtggtc atctttcagt 2400 
ggaccatcca tcctttgatg ttgaaaatct gcaagcaagt gaaattgaga acgacaaaaa 2460 
cagcccaact ctaagtttca aatgccccac aaatactaat atttcctctg tcaaatttgc 2520 
cagctttgga aatcctaatg gtacatgtgg ctcctacatc ctaggagact gccacgatca 2580 
gaattctgca gcactggtcg aaaaggtttg cctgaaccaa aatgagtgtg cattagaaat 2640 
gtccagcgca aactttaaca tgcaattgtg tccaagtaca gtaaagaaac ttgcagttga 2700 
agtgaattgc agctgagtgt cattgcccaa aatgaatgac atattctaat ttatatagtt 2760 
tgctacggag atgctcattc ttaaaccttt cttatatagc agaaaaatct gctattcctt 2820 
ctttcgtcta tgatttgaag tttaagatat gagtactgat gtcttattaa gcatcaccag 2880 
ataaccttgg atattcatgt ttgaaagact aagtattcat atttattcag tcgagatgca 2940 
agatttattt gtgaaaaaaa aaaaaaaaaa aa 2972 



<210>8 
<211> 835 
<212>PRT 

<213> Lycopersicon esculentum 
<400> 8 

Met Gly Phe Trp Met Ala Met Leu Leu Met Leu Leu Leu Cys Leu Trp 
1 5 10 15 

Val Ser Cys Gly lie Ala Ser Val Ser Tyr Asp His Lys Ala He He 
20 25 30 

Val Asn Gly Gin Arg Lys He Leu He Ser Gly Ser lie His Tyr Pro 
35 40 45 

Arg Ser Thr Pro Glu Met Trp Pro Asp Leu lie Gin Lys Ala Lys Glu 
50 55 60 



Gly Gly Val Asp Val He Gin Thr Tyr Val Phe Trp Asn Gly His Glu 



# 



65 70 75 80 

Pro Glu Glu Gly Lys Tyr Tyr Phe Glu Glu Arg Tyr Asp Leu Val Lys 
85 90 95 

Phe lie Lys Val Val Gin Glu Ala Gly Leu Tyr Val His Leu Arg lie 
100 105 110 

Gly Pro Tyr Ala Cys Ala Glu Trp Asn Phe Gly Gly Phe Pro Val Trp 
115 120 125 

Leu Lys Tyr Val Pro Gly He Ser Phe Arg Thr Asn Asn Glu Pro Phe 
130 135 140 

Lys Ala Ala Met Gin Lys Phe Thr Thr Lys lie Val Asp Met Met Lys 
145 150 155 160 

Ala Glu Lys Leu Tyr Glu Thr Gin Gly Gly Pro He lie Leu Ser Gin 
165 170 175 

He Glu Asn Glu Tyr Gly Pro Met Glu Trp Glu Leu Gly Glu Pro Gly 
180 185 190 

Lys Val Tyr Ser Glu Trp Ala Ala Lys Met Ala Val Asp Leu Gly Thr 
195 200 205 

Gly Val Pro Trp He Met Cys Lys Gin Asp Asp Val Pro Asp Pro He 
210 215 220 

He Asn Thr Cys Asn Gly Phe Tyr Cys Asp Tyr Phe Thr Pro Asn Lys 
225 230 235 240 

Ala Asn Lys Pro Lys Met Trp Thr Glu Ala Trp Thr Ala Tip Phe Thr 
245 250 255 

Glu Phe Gly Gly Pro Val Pro Tyr Arg Pro Ala Glu Asp Met Ala Phe 
260 265 270 

Ala Val Ala Arg Phe He Gin Thr Gly Gly Ser Phe lie Asn Tyr Tyr 
275 280 285 

Met Tyr His Gly Gly Thr Asn Phe Gly Arg Thr Ser Gly Gly Pro Phe 
290 295 300 

He Ala Thr Ser Tyr Asp Tyr Asp Ala Pro Leu Asp Glu Phe Gly Ser 
305 310 315 320 



Leu Arg Gin Pro Lys Trp Gly His Leu Lys Asp Leu His Arg Ala He 
325 330 335 

Lys Leu Cys Glu Pro Ala Leu Val Ser Val Asp Pro Thr Val Thr Ser 
340 345 350 

Leu Gly Asn Tyr Gin Glu Ala Arg Val Phe Lys Ser Glu Ser Gly Ala 
355 360 365 

Cys Ala Ala Phe Leu Ala Asn Tyr Asn Gin His Ser Phe Ala Lys Val 
370 375 380 

Ala Phe Gly Asn Met His Tyr Asn Leu Pro Pro Trp Ser lie Ser He 
385 390 395 400 

Leu Pro Asp Cys Lys Asn Thr Val Tyr Asn Thr Ala Arg Val Gly Ala 
405 410 415 

Gin Ser Ala Gin Met Lys Met Thr Pro Val Ser Arg Gly Phe Ser Trp 
420 425 430 

Glu Ser Phe Asn Glu Asp Ala Ala Ser His Glu Asp Asp Thr Phe Thr 
435 440 445 

Val Val Gly Leu Leu Glu Gin He Asn He Thr Arg Asp Val Ser Asp 
450 455 460 

Tyr Leu Tip Tyr Met Thr Asp lie Glu He Asp Pro Thr Glu Gly Phe 
465 470 475 480 

Leu Asn Ser Gly Asn Trp Pro Trp Leu Thr Val Phe Ser Ala Gly His 
485 490 495 

Ala Leu His Val Phe Val Asn Gly Gin Leu Ala Gly Thr Val Tyr Gly 
500 505 510 

Ser Leu Glu Asn Pro Lys Leu Thr Phe Ser Asn Gly He Asn Leu Arg 
515 520 525 

Ala Gly Val Asn Lys He Ser Leu Leu Ser He Ala Val Gly Leu Pro 
530 535 540 

Asn Val Gly Pro His Phe Glu Thr Trp Asn Ala Gly Val Leu Gly Pro 
545 550 555 560 

Val Ser Leu Asn Gly Leu Asn Glu Gly Thr Arg Asp Leu Thr Trp Gin 



565 570 575 

Lys Trp Phe Tyr Lys Val Gly Leu Lys Gly Glu Ala Leu Ser Leu His 
580 585 590 

Ser Leu Ser Gly Ser Pro Ser Val Glu Trp Val Glu Gly Ser Leu Val 
595 600 605 

Ala Gin Lys Gin Pro Leu Ser Trp Tyr Lys Thr Thr Phe Asn Ala Pro 
610 615 620 

Asp Gly Asn Glu Pro Leu Ala Leu Asp Met Asn Thr Met Gly Lys Gly 
625 630 635 640 

Gin Val Trp He Asn Gly Gin Ser Leu Gly Arg His Trp Pro Ala Tyr 
645 650 655 

Lys Ser Ser Gly Ser Cys Ser Val Cys Asn Tyr Thr Gly Trp Phe Asp 
660 665 670 

Glu Lys Lys Cys Leu Thr Asn Cys Gly Glu Gly Ser Gin Arg Trp Tyr 
675 680 685 

His Val Pro Arg Ser Trp Leu Tyr Pro Thr Gly Asn Leu Leu Val Val 
690 695 700 

Phe Glu Glu Trp Gly Gly Asp Pro Tyr Gly He Thr Leu Val Lys Arg 
705 710 715 720 

Glu He Gly Ser Val Cys Ala Asp He Tyr Glu Trp Gin Pro Gin Leu 
725 730 735 

Leu Asn Trp Gin Arg Leu Val Ser Gly Lys Phe Asp Arg Pro Leu Arg 
740 745 750 

Pro Lys Ala His Leu Lys Cys Ala Pro Gly Gin Lys He Ser Ser He 
755 760 765 

Lys Phe Ala Ser Phe Gly Thr Pro Glu Gly Val Cys Gly Asn Phe Gin 
770 775 780 

Gin Gly Ser Cys His Ala Pro Arg Ser Tyr Asp Ala Phe Lys Lys Asn 
785 790 795 800 

Cys Val Gly Lys Glu Ser Cys Ser Val Gin Val Thr Pro Glu Asn Phe 
805 810 815 



# 



Gly Gly Asp Pro Cys Arg Asn Val Leu Lys Lys Leu Ser Val Glu Ala 
820 825 830 



He Cys Ser 
835 



<210>9 
<211> 887 
<212>PRT 

<213> Lycopersicon esculentum 
<400> 9 

Ser Arg Arg Lys Thr Leu Asn Phe Pro Leu He Leu Thr Val Leu Thr 
15 10 15 

lie His Phe Val lie Val Ala Gly Glu Tyr Phe Lys Pro Phe Asn Val 
20 25 30 

Thr Tyr Asp Asn Arg Ala Leu He He Gly Gly Lys Arg Arg Met Leu 
35 40 45 

He Ser Ala Gly He His Tyr Pro Arg Ala Thr Pro Glu Met Trp Pro 
50 55 60 

Thr Leu lie Ala Arg Ser Lys Glu Gly Gly Ala Asp Val He Glu Thr 
65 70 75 80 

Tyr Thr Phe Trp Asn Gly His Glu Pro Thr Arg Gly Gin Tyr Asn Phe 
85 90 95 

Glu Gly Arg Tyr Asp lie Val Lys Phe Ala Lys Leu Val Gly Ser His 
100 105 110 

Gly Leu Phe Leu Phe He Arg He Gly Pro Tyr Ala Cys Ala Glu Trp 
115 120 125 

Asn Phe Gly Gly Phe Pro He Trp Leu Arg Asp lie Pro Gly He Glu 
130 135 140 

Phe Arg Thr Asp Asn Ala Pro Phe Lys Glu Glu Met Glu Arg Tyr Val 
145 150 155 160 



Lys Lys He Val Asp Leu Met He Ser Glu Ser Leu Phe Ser Trp Gin 
165 170 175 



Gly Gly Pro He lie Leu Leu Gin He Glu Asn Glu Tyr Gly Asn Val 
180 185 190 



Glu Ser Ser Phe Gly Pro Lys Gly Lys Leu Tyr Met Lys Trp Ala Ala 
195 200 205 

Glu Met Ala Val Gly Leu Gly Ala Gly Val Pro Trp Val Met Cys Arg 
210 215 220 

Gin Thr Asp Ala Pro Glu Tyr lie He Asp Thr Cys Asn Ala Tyr Tyr 
225 230 235 240 

Cys Asp Gly Phe Thr Pro Asn Ser Glu Lys Lys Pro Lys lie Trp Thr 
245 250 255 

Glu Asn Trp Asn Gly Trp Phe Ala Asp Trp Gly Glu Arg Leu Pro Tyr 
260 265 270 

Arg Pro Ser Glu Asp He Ala Phe Ala He Ala Arg Phe Phe Gin Arg 
275 280 285 

Gly Gly Ser Leu Gin Asn Tyr Tyr Met Tyr Phe Gly Gly Thr Asn Phe 
290 295 300 

Gly Arg Thr Ala Gly Gly Pro Thr Gin He Thr Ser Tyr Asp Tyr Asp 
305 310 315 320 

Ala Pro Leu Asp Glu Tyr Gly Leu Leu Arg Gin Pro Lys Trp Gly His 
325 330 335 

Leu Lys Asp Leu His Ala Ala He Lys Leu Cys Glu Pro Ala Leu Val 
340 345 350 

Ala Ala Asp Ser Pro Gin Tyr lie Lys Leu Gly Pro Lys Gin Glu Ala 
355 360 365 

His Val Tyr Arg Gly Thr Ser Asn Asn He Gly Gin Tyr Met Ser Leu 
370 375 380 

Asn Glu Gly He Cys Ala Ala Phe He Ala Asn He Asp Glu His Glu 
385 390 395 400 

Ser Ala Thr Val Lys Phe Tyr Gly Gin Glu Phe Thr Leu Pro Pro Trp 
405 410 415 

Ser Val Val Phe Cys Gin He Ala Glu He Gin Leu Ser Thr Gin Leu 



# 



420 



425 



430 



Arg Trp Gly His Lys Leu Gin Ser Lys Gin Trp Ala Gin He Leu Phe 
435 440 445 

Gin Leu Gly He He Leu Cys Phe Tyr Lys Leu Ser Leu Lys Ala Ser 
450 455 460 

Ser Glu Ser Phe Ser Gin Ser Trp Met Thr Leu Lys Glu Pro Leu Gly 
465 470 475 480 

Val Trp Gly Asp Lys Asn Phe Thr Ser Lys Gly lie Leu Glu His Leu 
485 490 495 

Asn Val Thr Lys Asp Gin Ser Asp Tyr Leu Tip Tyr Leu Thr Arg He 
500 505 510 

Tyr He Ser Asp Asp Asp He Ser Phe Trp Glu Glu Asn Asp Val Ser 
515 520 525 

Pro Thr He Asp lie Asp Ser Met Arg Asp Phe Val Arg He Phe Val 
530 535 540 

Asn Gly Gin Leu Ala Gly Ser Val Lys Gly Lys Trp He Lys Val Val 
545 550 555 560 

Gin Pro Val Lys Leu Val Gin Gly Tyr Asn Asp He Leu Leu Leu Ser 
565 570 575 

Glu Thr Val Gly Leu Gin Asn Tyr Gly Ala Phe Leu Glu Lys Asp Gly 
580 585 590 

Ala Gly Phe Lys Gly Gin He Lys Leu Thr Gly Cys Lys Ser Gly Asp 
595 600 605 

He Asn Leu Thr Thr Ser Leu Trp Thr Tyr Gin Val Gly Leu Arg Gly 
610 615 620 

Glu Phe Leu Glu Val Tyr Asp Val Asn Ser Thr Glu Ser Ala Gly Trp 
625 630 635 640 

Thr Glu Phe Pro Thr Gly Thr Thr Pro Ser Val Phe Ser Trp Tyr Lys 
645 650 655 

Thr Lys Phe Asp Ala Pro Gly Gly Thr Asp Pro Val Ala Leu Asp Phe 
660 665 670 



Ser Ser Met Gly Lys Gly Gin Ala Trp Val Asn Gly His His Val Gly 
675 680 685 

Arg Tyr Trp Thr Leu Val Ala Pro Asn Asn Gly Cys Gly Arg Thr Cys 
690 695 700 

Asp Tyr Arg Gly Ala Tyr His Ser Asp Lys Cys Arg Thr Asn Cys Gly 
705 710 715 720 

Glu He Thr Gin Ala Trp Tyr His lie Pro Arg Ser Trp Leu Lys Thr 
725 730 735 

Leu Asn Asn Val Leu Val He Phe Glu Glu Thr Asp Lys Thr Pro Phe 
740 745 750 

Asp He Ser lie Ser Thr Arg Ser Thr Glu Thr lie Cys Ala Gin Val 
755 760 765 

Ser Glu Lys His Tyr Pro Pro Leu His Lys Trp Ser His Ser Glu Phe 
770 775 780 

Asp Arg Lys Leu Ser Leu Met Asp Lys Thr Pro Glu Met His Leu Gin 
785 790 795 800 

Cys Asp Glu Gly His Thr lie Ser Ser lie Glu Phe Ala Ser Tyr Gly 
805 810 815 

Ser Pro Asn Gly Ser Cys Gin Lys Phe Ser Gin Gly Lys Cys His Ala 
820 825 830 

Ala Asn Ser Leu Ser Val Val Ser Gin Ala Cys He Gly Arg Thr Ser 
835 840 845 

Cys Ser He Gly He Ser Asn Gly Val Phe Gly Asp Pro Cys Arg His 
850 855 860 

Val Val Lys Ser Leu Ala Val Gin Ala Lys Cys Ser Pro Pro Pro Asp 
865 870 875 880 

Leu Ser Thr Ser Ala Ser Ser 
885 



<210> 10 
<211> 838 
<212>PRT 



<213> Lycopersicon esculentum 
<400> 10 

Met Gly Cys Thr Leu He Leu Met Leu Asn Val Leu Leu Val Leu Leu 
15 10 15 

Gly Ser Trp Val Phe Ser Gly Thr Ala Ser Val Ser Tyr Asp His Arg 
20 25 30 

Ala He He Val Asn Gly Gin Arg Arg He Leu He Ser Gly Ser Val 
35 40 45 

His Tyr Pro Arg Ser Thr Pro Glu Met Trp Pro Gly He He Gin Lys 
50 55 60 

Ala Lys Glu Gly Gly Val Asp Val He Gin Thr Tyr Val Phe Trp Asn 
65 70 75 80 

Gly His Glu Pro Gin Gin Gly Lys Tyr Tyr Phe Glu Gly Arg Tyr Asp 
85 90 95 

Leu Val Lys Phe lie Lys Leu Val His Gin Ala Gly Leu Tyr Val His 
100 105 110 

Leu Arg Val Gly Pro Tyr Ala Cys Ala Glu Trp Asn Phe Gly Gly Phe 
115 120 125 

Pro Val Trp Leu Lys Tyr Val Pro Gly He Ser Phe Arg Thr Asp Asn 
130 135 140 

Gly Pro Phe Lys Ala Ala Met Gin Lys Phe Thr Ala Lys He Val Asn 
145 150 155 160 

Met Met Lys Ala Glu Arg Leu Tyr Glu Thr Gin Gly Gly Pro He He 
165 170 175 

Leu Ser Gin He Glu Asn Glu Tyr Gly Pro Met Glu Trp Glu Leu Gly 
180 185 190 

Ala Pro Gly Lys Ser Tyr Ala Gin Trp Ala Ala Lys Met Ala Val Gly 
195 200 205 

Leu Asp Thr Gly Val Pro Trp Val Met Cys Lys Gin Asp Asp Ala Pro 
210 215 220 



Asp Pro He He Asn Ala Cys Asn Gly Phe Tyr Cys Asp Tyr Phe Ser 



# 



225 230 235 240 

Pro Asn Lys Ala Tyr Lys Pro Lys lie Trp Thr Glu Ala Tip Thr Ala 
245 250 255 

Trp Phe Thr Gly Phe Gly Asn Pro Val Pro Tyr Arg Pro Ala Glu Asp 
260 265 270 

Leu Ala Phe Ser Val Ala Lys Phe He Gin Lys Gly Gly Ser Phe lie 
275 280 285 

Asn Tyr Tyr Met Tyr His Gly Gly Thr Asn Phe Gly Arg Thr Ala Gly 
290 295 300 

Gly Pro Phe He Ala Thr Ser Tyr Asp Tyr Asp Ala Pro Leu Asp Glu 
305 310 315 320 

Tyr Gly Leu Leu Arg Gin Pro Lys Trp Gly His Leu Lys Asp Leu His 
325 330 335 

Arg Ala He Lys Leu Cys Glu Pro Ala Leu Val Ser Gly Asp Pro Ala 
340 345 350 

Val Thr Ala Leu Gly His Gin Gin Glu Ala His Val Phe Arg Ser Lys 
355 360 365 

Ala Gly Ser Cys Ala Ala Phe Leu Ala Asn Tyr Asp Gin His Ser Phe 
370 375 380 

Ala Thr Val Ser Phe Ala Asn Arg His Tyr Asn Leu Pro Pro Trp Ser 
385 390 395 400 

He Ser He Leu Pro Asp Cys Lys Asn Thr Val Phe Asn Thr Ala Arg 
405 410 415 

He Gly Ala Gin Ser Ala Gin Met Lys Met Thr Pro Val Ser Arg Gly 
420 425 430 

Leu Pro Trp Gin Ser Phe Asn Glu Glu Thr Ser Ser Tyr Glu Asp Ser 
435 440 445 

Ser Phe Thr Val Val Gly Leu Leu Glu Gin He Asn Thr Thr Arg Asp 
450 455 460 

Val Ser Asp Tyr Leu Trp Tyr Ser Thr Asp Val Lys He Asp Ser Arg 
465 470 475 480 



Glu Lys Phe Leu Arg Gly Gly Lys Tip Pro Tip Leu Thr He Met Ser 
485 490 495 

Ala Gly His Ala Leu His Val Phe Val Asn Gly Gin Leu Ala Gly Thr 
500 505 510 

Ala Tyr Gly Ser Leu Glu Lys Pro Lys Leu Thr Phe Ser Lys Ala Val 
515 520 525 

Asn Leu Arg Ala Gly Val Asn Lys He Ser Leu Leu Ser He Ala Val 
530 535 540 

Gly Leu Pro Asn He Gly Pro His Phe Glu Thr Trp Asn Ala Gly Val • 
545 550 555 560 

Leu Gly Pro Val Ser Leu Thr Gly Leu Asp Glu Gly Lys Arg Asp Leu 
565 570 575 

Thr Trp Gin Lys Trp Ser Tyr Lys Val Gly Leu Lys Gly Glu Ala Leu 
580 585 590 

Ser Leu His Ser Leu Ser Gly Ser Ser Ser Val Glu Trp Val Glu Gly 
595 600 605 

Ser Leu Val Ala Gin Arg Gin Pro Leu Thr Trp Tyr Lys Ser Thr Phe 
610 615 620 

Asn Ala Pro Ala Gly Asn Asp Pro Leu Ala Leu Asp Leu Asn Thr Met 
625 630 635 640 

Gly Lys Gly Gin Val Tip He Asn Gly Gin Ser Leu Gly Arg Tyr Trp 
645 650 655 

Pro Gly Tyr Lys Ala Ser Gly Asn Cys Gly Ala Cys Asn Tyr Ala Gly 
660 665 670 

Trp Phe Asn Glu Lys Lys Cys Leu Ser Asn Cys Gly Glu Ala Ser Gin 
675 680 685 

Arg Trp Tyr His Val Pro Arg Ser Trp Leu Tyr Pro Thr Gly Asn Leu 
690 695 700 

Leu Val Leu Phe Glu Glu Trp Gly Gly Glu Pro His Gly He Ser Leu 
705 710 715 720 

Val Lys Arg Glu Val Ala Ser Val Cys Ala Asp He Asn Glu Trp Gin 



725 



730 



735 



Pro Gin Leu Val Asn Trp Gin Met Gin Ala Ser Gly Lys Val Asp Lys 
740 745 750 

Pro Leu Arg Pro Lys Ala His Leu Ser Cys Ala Ser Gly Gin Lys He 
755 760 765 

Thr Ser lie Lys Phe Ala Ser Phe Gly Thr Pro Gin Gly Val Cys Gly 
770 775 780 

Ser Phe Arg Glu Gly Ser Cys His Ala Phe His Ser Tyr Asp Ala Phe 
785 790 795 800 

Glu Arg Tyr Cys lie Gly Gin Asn Ser Cys Ser Val Pro Val Thr Pro 
805 810 815 

Glu He Phe Gly Gly Asp Pro Cys Pro His Val Met Lys Lys Leu Ser 
820 825 830 

Val Glu Val He Cys Ser 
835 



<210> 11 

<211>724 

<212>PRT 

<213> Lycopersicon esculentum 
<400> 1 1 

Met Leu Arg Thr Asn Val Leu Leu Leu Leu Val He Cys Leu Leu Asp 
15 10 15 

Phe Phe Ser Ser Val Lys Ala Ser Val Ser Tyr Asp Asp Arg Ala He 
20 25 30 

He He Asn Gly Lys Arg Lys He Leu He Ser Gly Ser He His Tyr 
35 40 45 

Pro Arg Ser Thr Pro Gin Met Trp Pro Asp Leu lie Gin Lys Ala Lys 
50 55 60 

Asp Gly Gly Leu Asp Val He Glu Thr Tyr Val Phe Trp Asn Gly His 
65 70 75 80 



Glu Pro Ser Pro Gly Lys Tyr Asn Phe Glu Gly Arg Tyr Asp Leu Val 



85 90 95 

Arg Phe He Lys Met Val Gin Arg Ala Gly Leu Tyr Val Asn Leu Arg 
100 105 110 

lie Gly Pro Tyr Val Cys Ala Glu Trp Asn Phe Gly Gly Phe Pro Val 
115 120 125 

Trp Leu Lys Tyr Val Pro Gly Met Glu Phe Arg Thr Asn Asn Gin Pro 
130 135 140 

Phe Lys Val Ala Met Gin Gly Phe Val Gin Lys He Val Asn Met Met 
145 150 155 160 

Lys Ser Glu Asn Leu Phe Glu Ser Gin Gly Gly Pro He He Met Ala 
165 170 175 

Gin He Glu Asn Glu Tyr Gly Pro Val Glu Trp Glu He Gly Ala Pro 
180 185 190 

Gly Lys Ala Tyr Thr Lys Trp Ala Ala Gin Met Ala Val Gly Leu Lys 
195 200 205 

Thr Gly Val Pro Trp He Met Cys Lys Gin Glu Asp Ala Pro Asp Pro 
210 215 220 

Val He Asp Thr Cys Asn Gly Phe Tyr Cys Glu Gly Phe Arg Pro Asn 
225 230 235 240 

Lys Pro Tyr Lys Pro Lys Met Trp Thr Glu Val Trp Thr Gly Trp Tyr 
245 250 255 

Thr Lys Phe Gly Gly Pro He Pro Gin Arg Pro Ala Glu Asp lie Ala 
260 265 270 

Phe Ser Val Ala Arg Phe Val Gin Asn Asn Gly Ser Phe Phe Asn Tyr 
275 280 285 

Tyr Met Tyr His Gly Gly Thr Asn Phe Gly Arg Thr Ser Ser Gly Leu 
290 295 300 

Phe lie Ala Thr Ser Tyr Asp Tyr Asp Ala Pro Leu Asp Glu Tyr Gly 
305 310 315 320 



Leu Leu Asn Glu Pro Lys Tyr Gly His Leu Arg Asp Leu His Lys Ala 
325 330 335 



He Lys Leu Ser Glu Pro Ala Leu Val Ser Ser Tyr Ala Ala Val Thr 
340 345 350 

Ser Leu Gly Ser Asn Gin Glu Ala His Val Tyr Arg Ser Lys Ser Gly 
355 360 365 

Ala Cys Ala Ala Phe Leu Ser Asn Tyr Asp Ser Arg Tyr Ser Val Lys 
370 375 380 

Val Thr Phe Gin Asn Arg Pro Tyr Asn Leu Pro Pro Trp Ser He Ser 
385 390 395 400 

He Leu Pro Asp Cys Lys Thr Ala Val Tyr Asn Thr Ala Gin Val Asn 
405 410 415 

Ser Gin Ser Ser Ser He Lys Met Thr Pro Ala Gly Gly Gly Leu Ser 
420 425 430 

Trp Gin Ser Tyr Asn Glu Glu Thr Pro Thr Ala Asp Asp Ser Asp Thr 
435 440 445 

Leu Thr Ala Asn Gly Leu Trp Glu Gin Lys Asn Val Thr Arg Asp Ser 
450 455 460 

Ser Asp Tyr Leu Trp Tyr Met Thr Asn Val Asn He Ala Ser Asn Glu 
465 470 475 480 

Gly Phe Leu Lys Asn Gly Lys Asp Pro Tyr Leu Thr Val Met Ser Ala 
485 490 495 

Gly His Val Leu His Val Phe Val Asn Gly Lys Leu Ser Gly Thr Val 
500 505 510 

Tyr Gly Thr Leu Asp Asn Pro Lys Leu Thr Tyr Ser Gly Asn Val Lys 
515 520 525 

Leu Arg Ala Gly He Asn Lys He Ser Leu Leu Ser Val Ser Val Gly 
530 535 540 

Leu Pro Asn Val Gly Val His Tyr Asp Thr Trp Asn Ala Gly Val Leu 
545 550 555 560 

Gly Pro Val Thr Leu Ser Gly Leu Asn Glu Gly Ser Arg Asn Leu Ala 
565 570 575 

Lys Gin Lys Trp Ser Tyr Lys Val Gly Leu Lys Gly Glu Ser Leu Ser 



II 



580 585 590 

Leu His Ser Leu Ser Gly Ser Ser Ser Val Glu Trp Val Arg Gly Ser 
595 600 605 

Leu Met Ala Gin Lys Gin Pro Leu Thr Trp Tyr Lys Ala Thr Phe Asn 
610 615 620 

Ala Pro Gly Gly Asn Asp Pro Leu Ala Leu Asp Met Ala Ser Met Gly 
625 630 635 640 

Lys Gly Gin He Trp He Asn Gly Glu Gly Val Gly Arg His Trp Pro 
645 650 655 

Gly Tyr He Ala Gin Gly Asp Cys Ser Lys Cys Ser Tyr Ala Gly Thr 
660 665 670 

Phe Asn Glu Lys Lys Cys Gin Thr Asn Cys Gly Gin Pro Ser Gin Arg 
675 680 685 

Tip Tyr His Val Pro Arg Ser Trp Leu Lys Pro Ser Gly Asn Leu Leu 
690 695 700 

Val Val Phe Glu Glu Trp Gly Gly Asn Pro Thr Gly He Ser Leu Val 
705 710 715 720 

Arg Arg Ser Arg 



<210> 12 

<211>251 

<212>PRT 

<213> Lycopersicon esculentum 
<400> 12 

He Gin Thr Tyr Val Phe Trp Asn Leu His Glu Pro Val Arg Asn Gin 
15 10 15 

Tyr Asp Phe Glu Gly Arg Lys Asp Leu He Asn Phe Val Lys Leu Val 
20 25 30 

Glu Arg Ala Gly Leu Phe Val His lie Arg He Gly Pro Tyr Val Cys 
35 40 45 

Ala Glu Trp Asn Tyr Gly Gly Phe Pro Leu Trp Leu His Phe He Pro 



50 55 60 

Gly He Glu Phe Arg Thr Asp Asn Glu Pro Phe Lys Ala Glu Met Lys 
65 70 75 80 

Arg Phe Thr Ala Lys He Val Asp Met He Lys Gin Glu Asn Leu Tyr 
85 90 95 

Ala Ser Gin Gly Gly Pro Val He Leu Ser Gin He Glu Asn Glu Tyr 
100 105 110 

Gly Asn Gly Asp He Glu Ser Arg Tyr Gly Pro Arg Ala Lys Pro Tyr 
115 120 125 

Val Asn Trp Ala Ala Ser Met Ala Thr Ser Leu Asn Thr Gly Val Pro 
130 135 140 

Trp Val Met Cys Gin Gin Pro Asp Ala Pro Pro Ser Val He Asn Thr 
145 150 155 160 

Cys Asn Gly Phe Tyr Cys Asp Gin Phe Lys Gin Asn Ser Asp Lys Thr 
165 170 175 

Pro Lys Met Trp Thr Glu Asn Trp Thr Gly Trp Phe Leu Ser Phe Gly 
180 185 190 

Gly Phe Val Pro Tyr Arg Pro Val Glu Asp He Ala Phe Ala Val Ala 
195 200 205 

Arg Phe Phe Gin Arg Gly Gly Thr Phe Gin Asn Tyr Tyr Met Tyr His 
210 215 220 

Gly Gly Thr Asn Phe Gly Arg Thr Ser Gly Gly Pro Phe He Ala Thr 
225 230 235 240 

Ser Tyr Asp Tyr Asp Ala Pro Leu Asp Glu Tyr 
245 250 



<210> 13 

<211>249 

<212>PRT 

<213> Lycopersicon esculentum 
<400> 13 

lie Gin Thr Tyr Val Phe Trp Asn Val His Glu Pro Ser Pro Gly Asn 



15 10 15 

Tyr Asn Phe Glu Gly Arg Tyr Asp Leu Val Arg Phe Val Lys Thr He 
20 25 30 

Gin Lys Ala Gly Leu Tyr Ala His Leu Arg lie Gly Pro Tyr Val Cys 
35 40 45 

Ala Glu Trp Asn Phe Gly Gly Phe Pro Val Tip Leu Lys Tyr Val Pro 
50 55 60 

Gly He Ser Phe Arg Ala Asp Asn Glu Pro Phe Lys Asn Ala Met Lys 
65 70 75 80 

Gly Tyr Ala Glu Lys He Val Asn Leu Met Lys He He He Phe Ser 
85 90 95 

Ser Leu Arg Val Val Gin Ser Tyr Ser His Arg Leu Arg Met Ser Met 
100 105 110 

Gly Leu Lys Pro Arg Tyr Leu Glu His Arg Asp He Ser He Gin His 
115 120 125 

Gly Leu Gin He Trp Gin Leu Asp Leu Asn Thr Gly Val Pro Trp Val 
130 135 140 

Met Cys Lys Glu Glu Asp Ala Pro Asp Pro Val He Asn Thr Cys Asn 
145 150 155 160 

Gly Phe Tyr Cys Asp Asn Phe Phe Pro Asn Lys Pro Tyr Lys Pro Ala 
165 170 175 

He Trp Thr Glu Ala Trp Ser Gly Trp Phe Ser Glu Phe Gly Gly Pro 
180 185 190 

Leu His Gin Arg Pro Val Gin Asp Leu Ala Phe Ala Val Ala Gin Phe 
195 200 205 

He Gin Arg Gly Gly Ser Phe Val Asn Tyr Tyr Met Tyr His Gly Gly 
210 215 220 

Thr Asn Phe Gly Arg Thr Ala Gly Gly Pro Phe He Thr Thr Ser Tyr 
225 230 235 240 

Asp Tyr Asp Ala Pro Leu Asp Glu Tyr 
245 



<210> 14 

<211>870 

<212>PRT 

<213> Lycopersicon esculentum 
<400> 14 

Met Asn Thr Met Ser Cys Leu Ser Ser Asn Phe Lys Phe Val Phe Leu 
15 10 15 

Ala Ser Thr Val He Trp Met Thr Val Met Ser Ser Ser Leu Ala Ala 
20 25 30 

Val Asp Ala Ser Asn Val Thr Thr He Gly Thr Asp Ser Val Thr Tyr 
35 40 45 

Asp Arg Arg Ser Leu lie He Asn Gly Gin Arg Lys Leu Leu He Ser 
50 55 60 

Ala Ser lie His Tyr Pro Arg Ser Val Pro Ala Met Trp Pro Gly Leu 

65 70 75 80 

Val Arg Leu Ala Lys Glu Gly Gly Val Asp Val lie Glu Thr Tyr Val 
85 90 95 

Phe Trp Asn Gly His Glu Pro Ser Pro Gly Asn Tyr Tyr Phe Gly Gly 
100 105 110 

Arg Phe Asp Leu Val Lys Phe Cys Lys He He Gin Gin Ala Gly Met 
115 120 125 

Tyr Met He Leu Arg He Gly Pro Phe Val Ala Ala Glu Trp Asn Phe 
130 135 140 

Gly Gly Leu Pro Val Trp Leu His Tyr Val Pro Gly Thr Thr Phe Arg 
145 150 155 160 

Thr Asp Ser Glu Pro Phe Lys Tyr His Met Gin Lys Phe Met Thr Tyr 
165 170 175 

Thr Val Asn Leu Met Lys Arg Glu Arg Leu Phe Ala Ser Gin Gly Gly 
180 185 190 

Pro He He Leu Ser Gin Val Glu Asn Glu Tyr Gly Tyr Tyr Glu Asn 
195 200 205 



Ala Tyr Gly Glu Gly Gly Lys Arg Tyr Ala Leu Trp Ala Ala Lys Met 
210 215 220 

Ala Leu Ser Gin Asn Thr Gly Val Pro Trp He Met Cys Gin Gin Tyr 
225 230 235 240 

Asp Ala Pro Asp Pro Val He Asp Thr Cys Asn Ser Phe Tyr Cys Asp 
245 250 255 

Gin Phe Lys Pro He Ser Pro Asn Lys Pro Lys He Trp Thr Glu Asn 
260 265 270 

Trp Pro Gly Trp Phe Lys Thr Phe Gly Ala Arg Asp Pro His Arg Pro 
275 280 285 

Ala Glu Asp Val Ala Tyr Ser Val Ala Arg Phe Phe Gin Lys Gly Gly 
290 295 300 

Ser Val Gin Asn Tyr Tyr Met Tyr His Gly Gly Thr Asn Phe Gly Arg 
305 310 315 320 

Thr Ala Gly Gly Pro Phe lie Thr Thr Ser Tyr Asp Tyr Asp Ala Pro 
325 330 335 

He Asp Glu Tyr Gly Leu Pro Arg Phe Pro Lys Trp Gly His Leu Lys 
340 345 350 

Glu Leu His Lys Val lie Lys Ser Cys Glu His Ala Leu Leu Asn Asn 
355 360 365 

Asp Pro Thr Leu Leu Ser Leu Gly Pro Leu Gin Glu Ala Asp Val Tyr 
370 375 380 

Glu Asp Ala Ser Gly Ala Cys Ala Ala Phe Leu Ala Asn Met Asp Asp 
385 390 395 400 

Lys Asn Asp Lys Val Val Gin Phe Arg His Val Ser Tyr His Leu Pro 
405 410 415 

Ala Trp Ser Val Ser He Leu Pro Asp Cys Lys Asn Val Ala Phe Asn 
420 425 430 

Thr Ala Lys Val Gly Cys Gin Thr Ser He Val Asn Met Ala Pro He 
435 440 445 



Asp Leu His Pro Thr Ala Ser Ser Pro Lys Arg Asp He Lys Ser Leu 



# 



450 



455 



460 



Gin Tip Glu Val Phe Lys Glu Thr Ala Gly Val Trp Gly Val Ala Asp 
465 470 475 480 

Phe Thr Lys Asn Gly Phe Val Asp His lie Asn Thr Thr Lys Asp Ala 
485 490 495 

Thr Asp Tyr Leu Trp Tyr Thr Thr Ser He Phe Val His Ala Glu Glu 
500 505 510 

Asp Phe Leu Arg Asn Arg Gly Thr Ala Met Leu Phe Val Glu Ser Lys 
515 520 525 

Gly His Ala Met His Val Phe He Asn Lys Lys Leu Gin Ala Ser Ala 
530 535 540 

Ser Gly Asn Gly Thr Val Pro Gin Phe Lys Phe Gly Thr Pro lie Ala 
545 550 555 560 

Leu Lys Ala Gly Lys Asn Glu lie Ser Leu Leu Ser Met Thr Val Gly 
565 570 575 

Leu Gin Thr Ala Gly Ala Phe Tyr Glu Trp He Gly Ala Gly Pro Thr 
580 585 590 

Ser Val Lys Val Ala Gly Phe Lys Thr Gly Thr Met Asp Leu Thr Ala 
595 600 605 

Ser Ala Trp Thr Tyr Lys He Gly Leu Gin Gly Glu His Leu Arg lie 
610 615 620 

Gin Lys Ser Tyr Asn Leu Lys Ser Lys He Trp Ala Pro Thr Ser Gin 
625 630 635 640 

Pro Pro Lys Gin Gin Pro Leu Thr Trp Tyr Lys Ala Val Val Asp Ala 
645 650 655 

Pro Pro Gly Asn Glu Pro Val Ala Leu Asp Met He His Met Gly Lys 
660 665 670 

Gly Met Ala Trp Leu Asn Gly Gin Glu He Gly Arg Tyr Trp Pro Arg 
675 680 685 



Arg Thr Ser Lys Tyr Glu Asn Cys Val Thr Gin Cys Asp Tyr Arg Gly 
690 695 700 



Lys Phe Asn Pro Asp Lys Cys Val Thr Gly Cys Gly Gin Pro Thr Gin 
705 710 715 720 

Arg Tip Tyr His Val Pro Arg Ser Trp Phe Lys Pro Ser Gly Asn Val 
725 730 735 

Leu He He Phe Glu Glu lie Gly Gly Asp Pro Ser Gin He Arg Phe 



Ser Met Arg Lys Val Ser Gly Ala Cys Gly His Leu Ser Val Asp His 
755 760 765 

Pro Ser Phe Asp Val Glu Asn Leu Gin Gly Ser Glu He Glu Asn Asp 
770 775 780 

Lys Asn Arg Pro Thr Leu Ser Leu Lys Cys Pro Thr Asn Thr Asn He 
785 790 795 800 

Ser Ser Val Lys Phe Ala Ser Phe Gly Asn Pro Asn Gly Thr Cys Gly 
805 810 815 

Ser Tyr Met Leu Gly Asp Cys His Asp Gin Asn Ser Ala Ala Leu Val 
820 825 830 

Glu Lys Val Cys Leu Asn Gin Asn Glu Cys Ala Leu Glu Met Ser Ser 
835 840 845 

Ala Asn Phe Asn Met Gin Leu Cys Pro Ser Thr Val Lys Lys Leu Ala 
850 855 860 



740 



745 



750 



Val Glu Val Asn Cys Ser 
865 870 



